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TECHNICAL FIELD 

The present invention relates generally to therapy and diagnosis of cancer, 
such as colon cancer. The invention is more specifically related to polypeptides comprising 
at least a portion of a colon tumor protein, and to polynucleotides encoding such 
polypeptides. Such polypeptides and polynucleotides may be used in vaccines and 
pharmaceutical compositions for prevention and treatment of colon cancer, and for the 
diagnosis and monitoring of such cancers. 

BACKGROUND OF THE INVENTION 

Cancer is a significant health problem throughout the world. Although 
advances have been made in detection and therapy of cancer, no vaccine or other 
universally successful method for prevention or treatment is currently available. Current 
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therapies, which are generally based on a combination of chemotherapy or surgery and 
radiation, continue to prove inadequate in many patients. 

Colon cancer is the second most frequently diagnosed malignancy in the 
United States as well as the second most common cause of cancer death. An estimated 
5 95,600 new cases of colon cancer will be diagnosed in 1998, with an estimated 47,700 
deaths. The five-year survival rate for patients with colorectal cancer detected in an early 
localized stage is 92%; unfortunately, only 37% of colorectal cancer is diagnosed at this 
stage. The survival rate drops to 64% if the cancer is allowed to spread to adjacent organs 
or lymph nodes, and to 7% in patients with distant metastases. 

10 The prognosis of colon cancer is directly related to the degree of penetration 

of the tumor through the bowel wall and the presence or absence of nodal involvement, 
consequently, early detection and treatment are especially important. Currently, diagnosis 
is aided by the use of screening assays for fecal occult blood, sigmoidoscopy, colonoscopy 
and double contrast barium enemas. Treatment regimens are determined by the type and 

15 stage of the cancer, and include surgery, radiation therapy and/or chemotherapy. 
Recurrence following surgery (the most common form of therapy) is a major problem and 
is often the ultimate cause of death. In spite of considerable research into therapies for the 
disease, colon cancer remains difficult to diagnose and treat. In spite of considerable 
research into therapies for these and other cancers, colon cancer remains difficult to 

20 diagnose and treat effectively. Accordingly, there is a need in the art for improved methods 
for detecting and treating such cancers. The present invention fulfills these needs and 
further provides other related advantages. 

SUMMARY OF THE INVENTION 

Briefly stated, the present invention provides compositions and methods for 
25 the diagnosis and therapy of cancer, such as colon cancer. In one aspect, the present 
invention provides polypeptides comprising at least a portion of a colon tumor protein, or a 
variant thereof . Certain portions and other variants are immunogenic, such that the ability 
of the variant to react with antigen-specific antisera is not substantially diminished. Within 
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certain embodiments, the polypeptide comprises a sequence that is encoded by a 
polynucleotide sequence selected from the group consisting of: (a) sequences recited in 
SEQ ID NO: 1-121, 123-197, 205-630, 632-684, 686, 690-691 and 694-1081; (b) variants 
of a sequence recited in SEQ ID NO: 1-121, 123-197, 205-630 and 632-684, 686, 690-691 
5 and 694-108 1 ; and (c) complements of a sequence of (a) or (b). 

The present invention further provides polynucleotides that encode a 
polypeptide as described above, or a portion thereof (such as a portion encoding at least 15 
amino acid residues of a colon tumor protein), expression vectors comprising such 
polynucleotides and host cells transformed or transfected with such expression vectors. 
10 Within other aspects, the present invention provides pharmaceutical 

compositions comprising a polypeptide or polynucleotide as described above and a 
physiologically acceptable carrier. 

Within a related aspect of the present invention, vaccines are provided. 
Such vaccines comprise a polypeptide or polynucleotide as described above and an 
1 5 immunostimulant. 

The present invention further provides pharmaceutical compositions that 
comprise: (a) an antibody or antigen-binding fragment thereof that specifically binds to a 
colon tumor protein; and (b) a physiologically acceptable carrier. 

Within further aspects, the present invention provides pharmaceutical 
20 compositions comprising: (a) an antigen presenting cell that expresses a polypeptide as 
described above and (b) a pharmaceutical^ acceptable carrier or excipient. Antigen 
presenting cells include dendritic cells, macrophages, monocytes, fibroblasts and B cells. 

Within related aspects, vaccines are provided that comprise: (a) an antigen 
presenting cell that expresses a polypeptide as described above and (b) an 
25 immunostimulant. 

The present invention further provides, in other aspects, fusion proteins that 
comprise at least one polypeptide as described above, as well as polynucleotides encoding 
such fusion proteins. 
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Within related aspects, pharmaceutical compositions comprising a fusion 
protein, or a polynucleotide encoding a fusion protein, in combination with a 
physiologically acceptable carrier are provided. 

Vaccines are further provided, within other aspects, that comprise a fusion 
5 protein, or a polynucleotide encoding a fusion protein, in combination with an 
immunostimulant. 

Within further aspects, the present invention provides methods for inhibiting 
the development of a cancer in a patient, comprising administering to a patient a 
pharmaceutical composition or vaccine as recited above. 

10 The present invention further provides, within other aspects, methods for 

removing tumor cells from a biological sample, comprising contacting a biological sample 
with T cells that specifically react with a colon tumor protein, wherein the step of 
contacting is performed under conditions and for a time sufficient to permit the removal of 
cells expressing the protein from the sample. 

15 Within related aspects, methods are provided for inhibiting the development 

of a cancer in a patient, comprising administering to a patient a biological sample treated as 
described above. 

Methods are further provided, within other aspects, for stimulating and/or 
expanding T cells specific for a colon tumor protein, comprising contacting T cells with 
20 one or more of: (i) a polypeptide as described above; (ii) a polynucleotide encoding such a 
polypeptide; and/or (iii) an antigen presenting cell that expresses such a polypeptide; under 
conditions and for a time sufficient to permit the stimulation and/or expansion of T cells. 
Isolated T cell populations comprising T cells prepared as described above are also 
provided. 

25 Within further aspects, the present invention provides methods for inhibiting 

the development of a cancer in a patient, comprising administering to a patient an effective 
amount of a T cell population as described above. 

The present invention further provides methods for inhibiting the 
development of a cancer in a patient, comprising the steps of: (a) incubating CD4 + and/or 
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CD8 4 " T cells isolated from a patient with one or more of: (i) a polypeptide comprising at 
least an immunogenic portion of a colon tumor protein; (ii) a polynucleotide encoding such 
a polypeptide; and (iii) an antigen-presenting cell that expresses such a polypeptide; and (b) 
administering to the patient an effective amount of the proliferated T cells, and thereby 
5 inhibiting the development of a cancer in the patient. Proliferated cells may, but need not, 
be cloned prior to administration to the patient. 

Within further aspects, the present invention provides methods for 
determining the presence or absence of a cancer in a patient, comprising: (a) contacting a 
biological sample obtained from a patient with a binding agent that binds to a polypeptide 

10 as recited above; (b) detecting in the sample an amount of polypeptide that binds to the 
binding agent; and (c) comparing the amount of polypeptide with a predetermined cut-off 
value, and therefrom determining the presence or absence of a cancer in the patient. Within 
preferred embodiments, the binding agent is an antibody, more preferably a monoclonal 
antibody. The cancer may be colon cancer. 

15 The present invention also provides, within other aspects, methods for 

monitoring the progression of a cancer in a patient. Such methods comprise the steps of: 

(a) contacting a biological sample obtained from a patient at a first point in time with a 
binding agent that binds to a polypeptide as recited above; (b) detecting in the sample an 
amount of polypeptide that binds to the binding agent; (c) repeating steps (a) and (b) using 

20 a biological sample obtained from the patient at a subsequent point in time; and (d) 
comparing the amount of polypeptide detected in step (c) with the amount detected in step 

(b) and therefrom monitoring the progression of the cancer in the patient. 

The present invention further provides, within other aspects, methods for 
determining the presence or absence of a cancer in a patient, comprising the steps of: (a) 
25 contacting a biological sample obtained from a patient with an oligonucleotide that 
hybridizes to a polynucleotide that encodes a colon tumor protein; (b) detecting in the 
sample a level of a polynucleotide, preferably mRNA, that hybridizes to the 
oligonucleotide; and (c) comparing the level of polynucleotide that hybridizes to the 
oligonucleotide with a predetermined cut-off value, and therefrom determining the presence 
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or absence of a cancer in the patient. Within certain embodiments, the amount of mRNA is 
detected via polymerase chain reaction using, for example, at least one oligonucleotide 
primer that hybridizes to a polynucleotide encoding a polypeptide as recited above, or a 
complement of such a polynucleotide. Within other embodiments, the amount of mRNA is 
detected using a hybridization technique, employing an oligonucleotide probe that 
hybridizes to a polynucleotide that encodes a polypeptide as recited above, or a 
complement of such a polynucleotide. 

In related aspects, methods are provided for monitoring the progression of a 
cancer in a patient, comprising the steps of: (a) contacting a biological sample obtained 
from a patient with an oligonucleotide that hybridizes to a polynucleotide that encodes a 
colon tumor protein; (b) detecting in the sample an amount of a polynucleotide that 
hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) using a biological sample 
obtained from the patient at a subsequent point in time; and (d) comparing the amount of 
polynucleotide detected in step (c) with the amount detected in step (b) and therefrom 
monitoring the progression of the cancer in the patient. 

Within further aspects, the present invention provides antibodies, such as 
monoclonal antibodies, that bind to a polypeptide as described above, as well as diagnostic 
kits comprising such antibodies. Diagnostic kits comprising one or more oligonucleotide 
probes or primers as described above are also provided. 

These and other aspects of the present invention will become apparent upon 
reference to the following detailed description and attached figures. All references 
disclosed herein are hereby incorporated by reference in their entirety as if each was 
incorporated individually. 

SEQUENCE IDENTIFIERS 

SEQ ID NO: 1 is a first determined cDNA sequence for Contig 1, showing 
homology to Neutrophil Gelatinase Associated Lipocalin. 

SEQ ID NO: 2 is the determined cDNA sequence for Contig 2, showing no 
significant homology to any known genes. 
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SEQ ED NO: 3 is the determined cDNA sequence for Contig 4, showing homology 
to Carcinoembryonic antigen. 

SEQ ED NO: 4 is the determined cDNA sequence for Contig 5, showing homology 
to Carcinoembryonic antigen. 
5 SEQ ID NO: 5 is the determined cDNA sequence for Contig 9, showing homology 

to Carcinoembryonic antigen. 

SEQ ED NO: 6 is the determined cDNA sequence for Contig 52, showing homology 
to Carcinoembryonic antigen. 

SEQ ID NO: 7 is the determined cDNA sequence for Contig 6, showing homology 
10 toVillin. 

SEQ ID NO: 8 is the determined cDNA sequence for Contig 8, showing no 
significant homology to any known genes. 

SEQ ID NO: 9 is the determined cDNA sequence for Contig 10, showing homology 
to Transforming Growth Factor (BIGH3). 





15 


SEQ 


ID NO: 10 is the determined cDNA sequence for Contig 19, 


showing 






homology to 


Transforming Growth Factor (BIGH3). 








SEQ 


ID NO: 11 is the determined cDNA sequence for Contig 21, 


showing 






homology to 


Transforming Growth Factor (BIGH3). 




m 




SEQ 


ID NO: 12 is the determined cDNA sequence for Contig 11, 


showing 


1=1 


20 


homology to 


CO-029. 








SEQ 


ID NO: 13 is the determined cDNA sequence for Contig 55, 


showing 






homology to 


CO-029. 








SEQ 


ID NO: 14 is the determined cDNA sequence for Contig 12, 


showing 






homology to 


Chromosome 17, clone hRPC.l 171_I_10, also referred to as C798P. 






25 


SEQ 


ID NO: 15 is the determined cDNA sequence for Contig 13, showing no 



significant homology to any known gene. 

SEQ ID NO: 16 is the determined cDNA sequence for Contig 14, also referred to as 
14261, showing no significant homology to any known gene. 
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SEQ ID NO: 17 is the determined cDNA sequence for Contig 15, showing 
homology to Ets-Related Transcription Factor (ERT). 

SEQ ID NO: 18 is the determined cDNA sequence for Contig 16, showing 
homology to Chromosome 5, PAC clone 228g9 (LBNL H142). 

SEQ ID NO: 19 is the determined cDNA sequence for Contig 24, showing 
homology to Chromosome 5, PAC clone 228g9 (LBNL H142). 

SEQ ID NO: 20 is the determined cDNA sequence for Contig 17, showing 
homology to Cytokeratin. 

SEQ ID NO: 21 is the determined cDNA sequence for Contig 18, showing 
homology to Ll-Cadherin. 

SEQ ID NO: 22 is the determined cDNA sequence for Contig 20, showing no 
significant homology to any known gene. 

SEQ ID NO: 23 is the determined cDNA sequence for Contig 22, showing 
homology to Bumetanide-sensitive Na-K-Cl cotransporter (NKCC1). 

SEQ ID NO: 24 is the determined cDNA sequence for Contig 23, showing no 
significant homology to any known gene. 

SEQ ID NO: 25 is the determined cDNA sequence for Contig 25, showing 
homology to Macrophage Inflammatory Protein 3 alpha. 

SEQ ID NO: 26 is the determined cDNA sequence for Contig 26, showing 
homology to Laminin. 

SEQ ED NO: 27 is the determined cDNA sequence for Contig 48, showing 
homology to Laminin. 

SEQ ID NO: 28 is the determined cDNA sequence for Contig 27, showing 
homology to Mytobularin (MTM1). 

SEQ ID NO: 29 is the determined cDNA sequence for Contig 28, showing 
homology to Chromosome 16 BAC clone CIT987SK-A-363E6. 

SEQ ID NO: 30 is the determined cDNA sequence for Contig 29, also referred to as 
C751P and 14247, showing no significant homology to any known gene, but partial 
homology to Rat GSK-3p-interacting protein Axil homolog. 
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SEQ ID NO: 31 is the determined cDNA sequence for Contig 30, showing 
homology to Zinc Finger Transcription Factor (ZNF207). 

SEQ ID NO: 32 is the determined cDNA sequence for Contig 31, showing no 
significant homology to any known gene, but partial homology to Mus musculus GOB-4 
5 homolog. 

SEQ ID NO: 33 is the determined cDNA sequence for Contig 35, showing no 
significant homology to any known gene, but partial homology to Mus musculus GOB-4 
homolog. 

SEQ ID NO: 34 is the determined cDNA sequence for Contig 32, showing no 
10 significant homology to any known gene. 

SEQ ID NO: 35 is the determined cDNA sequence for Contig 34, showing 
homology to Desmoglein 2. 

SEQ ID NO: 36 is the determined cDNA sequence for Contig 36, showing no 
significant homology to any known gene. 
15 SEQ ID NO: 37 is the determined cDNA sequence for Contig 37, showing 

homology to Putative Transmembrane Protein. 

SEQ ID NO: 38 is the determined cDNA sequence for Contig 38, also referred to as 
C796P and 14219, showing no significant homology to any known gene. 

SEQ ID NO: 39 is the determined cDNA sequence for Contig 40, showing 
20 homology to Nonspecific Cross-reacting Antigen. 

SEQ ID NO: 40 is the determined cDNA sequence for Contig 41, also referred to as 
C799P and 14308, showing no significant homology to any known gene. 

SEQ ED NO: 41 is the determined cDNA sequence for Contig 42, also referred to as 
C794P and 14309, showing no significant homology to any known gene. 
25 SEQ ID NO: 42 is the determined cDNA sequence for Contig 43, showing 

homology to Chromosome 1 specific transcript KIAA0487. 

SEQ ID NO: 43 is the determined cDNA sequence for Contig 45, showing 
homology to hMCM2. 
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SEQ ID NO: 44 is the determined cDNA sequence for Contig 46, showing 
homology to ETS2. 

SEQ ID NO: 45 is the determined cDNA sequence for Contig 49, showing 
homology to Pump-1. 

SEQ ID NO: 46 is the determined cDNA sequence for Contig 50, also referred to as 
C792P and 18323, showing no significant homology to any known gene. 

SEQ ID NO: 47 is the determined cDNA sequence for Contig 51, also referred to as 
C795P and 14317, showing no significant homology to any known gene. 

SEQ ID NO: 48 is the determined cDNA sequence for 11092, showing no 
significant homology to any known gene. 

SEQ ID NO: 49 is the determined cDNA sequence for 11093, showing no 
significant homology to any known gene. 

SEQ ID NO: 50 is the determined cDNA sequence for 11094, showing homology 
Human Putative Enterocyte Differentiation Protein. 

SEQ ID NO: 51 is the determined cDNA sequence for 1 1095, showing homology to 
Human Transcriptional Corepressor hKAPl/TIFIB mRNA. 

SEQ ID NO: 52 is the determined cDNA sequence for 11096, showing no 
significant homology to any known gene. 

SEQ ED NO: 53 is the determined cDNA sequence for 1 1097, showing homology to 
Human Nonspecific Antigen. 

SEQ ID NO: 54 is the determined cDNA sequence for 11098, showing no 
significant homology to any known gene. 

SEQ ID NO: 55 is the determined cDNA sequence for 1 1099, showing homology to 
Human Pancreatic Secretory Inhibitor (PST) mRNA. 

SEQ ID NO: 56 is the determined cDNA sequence for 1 1 186, showing homology to 
Human Pancreatic Secretory Inhibitor (PST) mRNA. 

SEQ ID NO: 57 is the determined cDNA sequence for 1 1 101, showing homology to 
Human Chromosome X. 
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SEQ ID NO: 58 is the determined cDNA sequence for 1 1 102, showing homology to 
Human Chromosome X. 

SEQ ID NO: 59 is the determined cDNA sequence for 11103, showing no 
significant homology to any known gene. 
5 SEQ ED NO: 60 is the determined cDNA sequence for 11174, showing no 

significant homology to any known gene. 

SEQ ID NO: 61 is the determined cDNA sequence for 1 1 104, showing homology to 
Human mRNA for KIAA0154. 

SEQ ID NO: 62 is the determined cDNA sequence for 11105, showing homology 
1 0 toHuman Apurinic/Apyrimidinic Endonuclease (hap 1 )mRNA. 

SEQ ID NO: 63 is the determined cDNA sequence for 11106, showing homology 
toHuman Chromosome 12pl3. 

SEQ ID NO: 64 is the determined cDNA sequence for 1 1 107, showing homology to 
Human 90 kDa Heat Shock Protein. 
15 SEQ ID NO: 65 is the determined cDNA sequence for 11108, showing no 

significant homology to any known gene. 

SEQ ID NO: 66 is the determined cDNA sequence for 11112, showing no 
significant homology to any known gene. 

SEQ ID NO: 67 is the determined cDNA sequence for 11115, showing no 
20 significant homology to any known gene. 

SEQ ED NO: 68 is the determined cDNA sequence for 11117, showing no 
significant homology to any known gene. 

SEQ ID NO: 69 is the determined cDNA sequence for 11118, showing no 
significant homology to any known gene. 
25 SEQ ID NO: 70 is the determined cDNA sequence for 1 1 1 19, showing homology to 

Human Elongation Factor 1 -alpha. 

SEQ ID NO: 71 is the determined cDNA sequence for 1 1 121, showing homology to 
Human Lamin B Receptor (LBR) mRNA. 
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SEQ ID NO: 72 is the determined cDNA sequence for 1 1 122, showing homology to 
H. sapiens mRNA for Novel Glucocorticoid. 

SEQ ID NO: 73 is the determined cDNA sequence for 1 1 123, showing homology to 
H. sapiens mRNA for snRNP protein B. 
5 SEQ ID NO: 74 is the determined cDNA sequence for 1 1 124, showing homology to 

Human Cisplatin Resistance Associated Beta-protein. 

SEQ ID NO: 75 is the determined cDNA sequence for 1 1 127, showing homology to 
M. musculus Calumenin mRNA. 

SEQ ID NO: 76 is the determined cDNA sequence for 1 1 128, showing homology to 
10 Human ras-related small GTP binding protein. 

SEQ ID NO: 77 is the determined cDNA sequence for 1 1 130, showing homology to 
Human Cosmid U169d2. 

SEQ ID NO: 78 is the determined cDNA sequence for 1 1 131, showing homology to 
H. sapiens mRNA for protein homologous to Elongation 1-g. 
15 SEQ ID NO: 79 is the determined cDNA sequence for 11134, showing no 

significant homology to any known gene. 

SEQ ID NO: 80 is the determined cDNA sequence for 1 1 135, showing homology to 
H. sapiens Nieman-Pick (NPC1) mRNA. 

SEQ ID NO: 81 is the determined cDNA sequence for 1 1 137, showing homology to 
20 H. sapiens mRNA for Niecin b-chain. 

SEQ ID NO: 82 is the determined cDNA sequence for 1 1 138, showing homology to 
Human Endogenous Retroviral Protease mRNA. 

SEQ ID NO: 83 is the determined cDNA sequence for 1 1 139, showing homology to 
H. sapiens mRNA for DMBT1 protein. 
25 SEQ ID NO: 84 is the determined cDNA sequence for 1 1 140, showing homology to 

H. sapiens ras GTPase activating-like protein. 

SEQ ID NO: 85 is the determined cDNA sequence for 1 1 143, showing homology to 
Human Acidic Ribosomal Phosphoprotein PO mRNA. 



12 



210121.471C11 



SEQ ED NO: 86 is the determined cDNA sequence for 1 1 144, showing homology to 
H. sapiens U21 mRNA. 

SEQ ED NO: 87 is the determined cDNA sequence for 1 1 145, showing homology to 
Human GTP-binding protein. 
5 SEQ ED NO: 88 is the determined cDNA sequence for 1 1 148, showing homology to 

H. sapiens U21 mRNA. 

SEQ ID NO: 89 is the determined cDNA sequence for 11151, showing no 
significant homology to any known gene. 

SEQ ID NO: 90 is the determined cDNA sequence for 11154, showing no 
10 significant homology to any known gene. 

SEQ ID NO: 91 is the determined cDNA sequence for 1 1 156, showing homology to 
H. sapiens Ribosomal Protein L27. 

SEQ ID NO: 92 is the determined cDNA sequence for 1 1 157, showing homology to 
H. sapiens Ribosomal Protein L27. 
15 SEQ ID NO: 93 is the determined cDNA sequence for 11158, showing no 

significant homology to any known gene. 

SEQ ID NO: 94 is the determined cDNA sequence for 1 1 162, showing homology to 
Ag-X antigen. 

SEQ ID NO: 95 is the determined cDNA sequence for 1 1 164, showing homology to 
20 H. sapiens mRNA for Signal Recognition Protein sub 14. 

SEQ ID NO: 96 is the determined cDNA sequence for 1 1 165, showing homology to 
Human PAC 204e5/127hl4. 

SEQ ID NO: 97 is the determined cDNA sequence for 1 1 166, showing homology to 
Human mRNA for KIAA0108. 
25 SEQ ID NO: 98 is the determined cDNA sequence for 1 1 167, showing homology to 

H. sapiens mRNA for Neutrophil Gelatinase asset. Lipocalin. 

SEQ ID NO: 99 is the determined cDNA sequence for 11168, showing no 
significant homology to any known gene. 
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SEQ ID NO: 100 is the determined cDNA sequence for 11172, showing no 
significant homology to any known gene. 

SEQ ED NO: 101 is the determined cDNA sequence for 11175, showing no 
significant homology to any known gene. 
5 SEQ ED NO: 102 is the determined cDNA sequence for 11176, showing homology 

to Human maspin mRNA. 

SEQ ID NO: 103 is the determined cDNA sequence for 11177, showing homology 
to Human Carcinoembryonic Antigen. 

SEQ ID NO: 104 is the determined cDNA sequence for 11178, showing homology 
10 to Human A-Tubulin mRNA. 

SEQ ID NO: 105 is the determined cDNA sequence for 11 179, showing homology 
to Human mRNA for proton- ATPase-like protein. 

SEQ ID NO: 106 is the determined cDNA sequence for 11180, showing homology 
to Human HepG2 3' region cDNA clone hmd. 
15 SEQ ID NO: 107 is the determined cDNA sequence for 1 1 182, showing homology 

to Human MHC homologous to Chicken B-Complex Protein. 

SEQ ID NO: 108 is the determined cDNA sequence for 1 1 183, showing homology 
to Human High Mobility Group Box (SSRP1) mRNA. 

SEQ ID NO: 109 is the determined cDNA sequence for 11184, showing no 
20 significant homology to any known gene. 

SEQ ID NO: 110 is the determined cDNA sequence for 11185, showing no 
significant homology to any known gene. 

SEQ ID NO: 111 is the determined cDNA sequence for 11187, showing no 
significant homology to any known gene. 
25 SEQ ID NO: 112 is the determined cDNA sequence for 11190, showing homology 

to Human Replication Protein A 70kDa. 

SEQ ED NO: 113 is the determined cDNA sequence for Contig 47, also referred to 
as C797P, showing homology to Human Chromosome X clone bWXD342. 
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SEQ ID NO: 114 is the determined cDNA sequence for Contig 7, showing 
homology to Equilibrative Nucleoside Transporter 2 (ent2). 

SEQ ID NO: 115 is the determined cDNA sequence for 14235.1, also referred to as 
C791P, showing homology to H. sapiens chromosome 21 derived BAG containing ets-2 
5 gene. 

SEQ ID NO: 116 is the determined cDNA sequence for 14287.2, showing no 
significant homology to any known gene, but some degree of homology to Putative 
Transmembrane Protein. 

SEQ ID NO: 117 is the determined cDNA sequence for 14233.1, also referred to as 
10 Contig 48, showing no significant homology to any known gene. 

SEQ ID NO: 1 18 is the determined cDNA sequence for 14298.2, also referred to as 
C793P, showing no significant homology to any known gene. 

SEQ ID NO: 119 is the determined cDNA sequence for 14372, also referred to as 
Contig 44, showing no significant homology to any known gene. 
15 SEQ ID NO: 120 is the determined cDNA sequence for 14295, showing homology 

to secreted cement gland protein XAG-2 homolog. 

SEQ ED NO: 121 is the determined full-length cDNA sequence for a clone showing 
homology to Beta IG-H3. 

SEQ ID NO: 122 is the predicted amino acid sequence for the clone of SEQ ID NO: 

20 121. 

SEQ ID NO: 123 is a longer determined cDNA sequence for C751P. 
SEQ ID NO: 124 is a longer determined cDNA sequence for C791P. 
SEQ ID NO: 125 is a longer determined cDNA sequence for C792P. 
SEQ ID NO: 126 is a longer determined cDNA sequence for C793P. 
25 SEQ ID NO: 127 is a longer determined cDNA sequence for C794P. 

SEQ ID NO: 128 is a longer determined cDNA sequence for C795P. 
SEQ ID NO: 129 is a longer determined cDNA sequence for C796P. 
SEQ ID NO: 130 is a longer determined cDNA sequence for C797P. 
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SEQ ID NO: 131 is a longer determined cDNA sequence for C798P. 
SEQ ID NO: 132 is a longer determined cDNA sequence for C799P. 
SEQ ID NO: 133 is a first partial determined cDNA sequence for CoSub-3 (also 
known as 23569). 

5 SEQ ID NO: 134 is a second partial determined cDNA sequence for CoSub-3 (also 

known as 23569). 

SEQ ED NO: 135 is a first partial determined cDNA sequence for CoSub-13 (also 
known as 23579). 

SEQ ID NO: 136 is a second partial determined cDNA sequence for CoSub-13 
10 (also known as 23579). 

SEQ ID NO: 137 is the determined cDNA sequence for CoSub-17 (also known as 

23583). 

SEQ ID NO: 138 is the determined cDNA sequence for CoSub-19 (also known as 

23585). 

15 SEQ ID NO: 139 is the determined cDNA sequence for CoSub-22 (also known as 

23714) . 

SEQ ID NO: 140 is the determined cDNA sequence for CoSub-23 (also known as 

23715) . 

SEQ ID NO: 141 is the determined cDNA sequence for CoSub-26 (also known as 

20 23717). 

SEQ ID NO: 142 is the determined cDNA sequence for CoSub-33 (also known as 

23724) . 

SEQ ID NO: 143 is the determined cDNA sequence for CoSub-34 (also known as 

23725) . 

25 SEQ ID NO: 144 is the determined cDNA sequence for CoSub-35 (also known as 

23726) . 

SEQ ID NO: 145 is the determined cDNA sequence for CoSub-37 (also known as 

23728). 



16 



210121.471C11 

SEQ ID NO: 146 is the determined cDNA sequence for CoSub-39 (also known as 

23730). 

SEQ ID NO: 147 is the determined cDNA sequence for CoSub-42 (also known as 

23766). 

5 SEQ ID NO: 148 is the determined cDNA sequence for CoSub-44 (also known as 

23768). 

SEQ ID NO: 149 is the determined cDNA sequence for CoSub-47 (also known as 

23771). 

SEQ ID NO: 150 is the determined cDNA sequence for CoSub-54 (also known as 

10 23778). 

SEQ ID NO: 151 is the determined cDNA sequence for CoSub-55 (also known as 

O 23779). 

gn SEQ ID NO: 152 is the determined cDNA sequence for CT1 (also known as 

J 24099). 

ffl 15 SEQ ID NO: 153 is the determined cDNA sequence for CT2 (also known as 

U 24100). 

SEQ ID NO: 154 is the determined cDNA sequence for CT3 (also known as 

S 24101). 

CO SEQ ID NO: 155 is the determined cDNA sequence for CT6 (also known as 

g 20 24104). 

SEQ ID NO: 156 is the determined cDNA sequence for CT7 (also known as 

24105). 

SEQ ID NO: 157 is the determined cDNA sequence for CT12 (also known as 

24110) . 

25 SEQ ID NO: 158 is the determined cDNA sequence for CT13 (also known as 

24111) . 

SEQ ID NO: 159 is the determined cDNA sequence for CT14 (also known as 

24112) . 
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24113) 



SEQ ID NO: 160 is the determined cDNA sequence for CT15 

sequence for CT17 
sequence for CT18 



SEQ ID NO: 161 is the determined cDNA 

24115). 

5 SEQ ID NO: 162 is the determined cDNA 



24116). 



23848). 



10 23849). 



23854). 



15 



23856). 



23859). 



23860). 



20 23861). 



24130). 



25 



24132). 



24595). 



24608). 



SEQ ID NO: 163 is the determined cDNA sequence for CT22 

>. 

SEQ ID NO: 164 is the determined cDNA sequence for CT24 

>. 

SEQ ID NO: 165 is the determined cDNA sequence for CT31 

>. 

SEQ ID NO: 166 is the determined cDNA sequence for CT34 

I. 

SEQ ID NO: 167 is the determined cDNA sequence for CT37 

I. 

SEQ ID NO: 168 is the determined cDNA sequence for CT39 

>. 

SEQ ID NO: 169 is the determined cDNA sequence for CT40 

). 

SEQ ID NO: 170 is the determined cDNA sequence for CT51 

I. 

SEQ ID NO: 171 is the determined cDNA sequence for CT53 

>. 

SEQ ID NO: 172 is the determined cDNA sequence for CT63 

>• 

SEQ ID NO: 173 is the determined cDNA sequence for CT88 



also known as 



also known as 



also known as 



also known as 



also known as 



also known as 



also known as 



also known as 



also known as 



also known as 



also known as 



also known as 



also known as 



also known as 
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SEQ ID NO: 

24800). 

SEQ ID NO: 

24802). 
5 SEQ ID NO: 

24805) . 

SEQ ID NO: 

24806) . 

SEQ ID NO: 

10 25520). 

SEQ ID NO: 

O 25522). 

£ SEQ ID NO: 

■P 25523). 

ffl 15 SEQ ID NO: 

U 25527). 

L SEQ ID NO: 

ffl 25534). 

| SEQ ID NO: 

g 20 25537). 

SEQ ID NO: 

25542) . 

SEQ ID NO: 

25543) . 

25 SEQ ID NO: 

25544) . 

SEQ ID NO: 

26420). 
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174 is the determined cDNA sequence for CT92 (also known as 

175 is the determined cDNA sequence for CT94 (also known as 

176 is the determined cDNA sequence for CT102 (also known as 

177 is the determined cDNA sequence for CT103 (also known as 

178 is the determined cDNA sequence for CT111 (also known as 

179 is the determined cDNA sequence for CT118 (also known as 

180 is the determined cDNA sequence for CT121 (also known as 

181 is the determined cDNA sequence for CT126 (also known as 

182 is the determined cDNA sequence for CT135 (also known as 

183 is the determined cDNA sequence for CT140 (also known as 

184 is the determined cDNA sequence for CT145 (also known as 

185 is the determined cDNA sequence for CT147 (also known as 

186 is the determined cDNA sequence for CT148 (also known as 

187 is the determined cDNA sequence for CT502 (also known as 
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SEQ ID NO: 188 is the determined cDNA sequence for CT507 (also known as 

26425). 

SEQ ID NO: 189 is the determined cDNA sequence for CT521 (also known as 

27366). 

5 SEQ ID NO: 190 is the determined cDNA sequence for CT544 (also known as 

27375). 

SEQ ID NO: 191 is the determined cDNA sequence for CT577 (also known as 

27385). 

SEQ ID NO: 192 is the determined cDNA sequence for CT580 (also known as 

10 27387). 

SEQ ID NO: 193 is the determined cDNA sequence for CT594 (also known as 

27540). 

SEQ ID NO: 194 is the determined cDNA sequence for CT606 (also known as 

27547) . 

15 SEQ ID NO: 195 is the determined cDNA sequence for CT607 (also known as 

27548) . 

SEQ ID NO: 196 is the determined cDNA sequence for CT599 (also known as 

27903). 

SEQ ID NO: 197 is the determined cDNA sequence for CT632 (also known as 

20 27922). 

SEQ ID NO: 198 is the predicted amino acid sequence for CT502 (SEQ ID NO: 

187) . 

SEQ ID NO: 199 is the predicted amino acid sequence for CT507 (SEQ ID NO: 

188) . 

25 SEQ ID NO: 200 is the predicted amino acid sequence for CT521 (SEQ ID NO: 

189) . 

SEQ ID NO: 201 is the predicted amino acid sequence for CT544 (SEQ ID NO: 

190) . 
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SEQ ID NO: 202 is the predicted amino acid sequence for CT606 (SEQ ID NO: 

194) . 

SEQ ID NO: 203 is the predicted amino acid sequence for CT607 (SEQ ID NO: 

195) . 

5 SEQ ID NO: 204 is the predicted amino acid sequence for CT632 (SEQ ID NO: 

197). 
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H ptprm i n pH 
UClCllIUllCU 


rDNA 

CIV IN /A. 


cpmipnrp 
act[ uciicc 


fnr rlnnp 

XUI C1U11C 








9FO 


TD 

LLJ 


NO 


297 


ic thp 
la luc 


H ptprm i n pH 
ViCLCl llllllCtl 


rDNA 

CIV IN /A 


cpmipnrp 
a ctj uciicc 


fnr rlnnp 

XUI C1UUC 


95RS7 

LJOJ / . 








TD 

LLJ 


NO 

INW. 


29R 


ic thp 
la uic 


H ptprm i n pH 

VIC LCI llllllCtl 


rDNA 

CIV IN /A. 


cpmipnrp 
a ct[ uciicc 


fnr rlnnp 

XUI C1U11C 


LJOJO. 






9FO 


TD 

LLJ 


NO 

IN W. 




ic thp 
ia 11 IC 


A ptprm i n pH 
tic LCI llllllCtl 


rDNA 

CJVlN 


act[uciicc 


fnr rlnnp 

XUI C1U11C 


/■JO J7. 




i c 

i j 


^FO 


TD 

LLJ 


NO 

INW. 


^00 


i c thp 
xa uic 


H ptprm i n pH 
tlC LCI llllilCtl 


rDNA 

CJV1N /A. 


c pn 1 1 pn r p 
accj^ ucucc 


fr\r p 1 r\r\ p 
XUI ClUllC 


LJOUU. 


H 1 




ox3*v[ 


TD 
xxy 


NO 

INW. 


jU 1 


1 C tn P 

la uic 


UCLClIIllllCU 


rDNA 

CX-/1N /\ 


c pn 1 1 pn r p 
actJucllCc 


XUX ClUllC 


ZJOO 1 . 


s 




OXJ/IV^ 


TD 

XX-/ 


NO 
INW. 


^09 


1 C t Vl p 

La IX1C 


CXClcrilllllCLL 


rDNA 
CXJIN/A 


cpmipnpp 
bcLJUCllCC 


XUI ClUllC 


ZJOOZ. 






<5Ffi 


TD 

XXy 


MA. 
INW. 




la UXC 


H ptprm i n p>H 

UclciiillIlcU 


rDNA 
CX-/lN/\ 


cpmipnpp 

actjucntyC 


xor ciunc 


ZjOOj. 


~0 




CCA 


TD 
xjv 


NO 

INW. 




1 O t Vl P 

la lilc 


/4 pt pt*m i n P/i 

ucicniiiricu 


rDNA 
c1V1n/\ 


ppnnpnpp 

Sequence 


lot cione 


ZJOOt. 




">ft 
zu 




TD 
XX-/ 


NO 
in W. 




1 C tVlP 

la uic 


A ptprm i n p/i 
UG ICi 1 1 llllCtl 


rDNA 
CJVlNrV 


actjucncc 


xur cionc 


9^Rf^^ 

ZjoOJ. 






<sFO 


TD 

XXV 


NO 

IN IV. 


^06 


la Lilc 


nptprminPn 

ucLcriiiiiicu 


rDNA 

ClVlN/\ 


c pn 1 1 pn r p 
bcCJUcllCc 


xor cioric 


ZJoOO. 






cpn 


TD 

XXV 


NO 

INW. 


^07 


IC trip 

la UIC 


Hptprm i tipH 
UCLC1 11I111CU 


rDNA 

CXVIN/\ 


c pn i i pn r p 
a Ctj UCIICC 


fnr rlnnp 

xor clone 


ZJOU / . 






OLZ\J 


TD 

LLJ 


NO 

INW. 


JuO 


1 C trlP 

ia uic 


A (^tt^TrY\ lnpn 
tlC ICI IlllllCtl 


rDNA 


o pn i l pn r p 

acqucucc 


TAr rlnnp 
XOX ClOllc 


9^868 

LJOUO. 






^FO 


TD 


NO 

IN W. 


109 

JV/7 


i c tTip 
ia ii ic 


A ptprm i n pn 
tlC ICI IlllllCtl 


rDNA 

CIV IN /A. 


c pn i i pn r p 
acu uciicc 


fnr rlnnp 
IUI ClUllC 


ZJOU7. 




Z3 




TD 

XXV 


NO 

IN W. 


J 1 V 


i c tTip 
ia uic 


ptprm inpH 
UCLCl IlllllCtl 


rDNA 

CIVIN-TA. 


cpmipnrp 
acq uciicc 


fnr p 1 nn p 
XUX ClUllC 


9SR70 
ZJO / u. 






SEQ 


ID 


\TA 

NO: 


3ll 


is the 


determined 


cDNA 


sequence 


for clone 


25871. 






SEQ 


ID 


NO: 


312 


is the 


determined 


cDNA 


sequence 


for clone 


25872. 






SEQ 


ID 


NO: 


313 


is the 


determined 


cDNA 


sequence 


for clone 


25873. 






SEQ 


ID 


NO: 


314 


is the 


determined 


cDNA 


sequence 


for clone 


25875. 



210121.471C11 







SEO 


ID 


NO- 


315 




the 

11 iw 


H eterm i n eH 

UCLCl IillliCU 


cDNA 


cpniienre 


for rlnne 


2S876 






SEO 


ED 


NO* 


316 


i q 

Id 


the 


H eterm i n eH 

UCLCl iUUlCvL 


cDNA 


Gemienre 


fnr rlnne 


2^5877 

Off. 






SEO 


TD 


NO- 


./ 1 / 


Id 


the 

nit- 


H eterm i n eH 

UCLCl UUUCU 


rDNA 


ceniienrp 


for rlnnp 

Ivl V/lt/liC 


2S878 

t*J o / o. 






SEO 


ID 


NO- 


318 


lo 


the 

LUI/ 


H eterm i n eH 

UvlUl UUUCU 


cDNA 


Qpniipnre 

dCt[ UCilt/C 


fnr rlnnp 

lUi t/lUllC 


2^87Q 






SEO 


TD 


NO* 

IN w . 


319 


id 


the 

LUC 


HetprminpH 

UCLCL IillliCU 


rDNA 


dCU LiCilCC 


fnr rlnnp 

1U1 C1U11C 


2S880 






SEO 


TD 


NO* 

IN \J m 


320 


Id 


the 

Lilt/ 


H ptprm i n pH 

UCLCl IillliCU 


rDNA 

t/i— /IN / V 


cpnnpnpp 

dCtJUCllCC 


fnr plnnp 

XUl ClUilC 


2S881 






SEO 


TD 

JUL/ 


NO* 

IN V/. 


321 


id 


the 

11 1C 


H ptprm \ n pH 

UCltl IillliCU 


rDNA 

t/l— 'IN -T\ 


cpnnpnpp 

dClJUCilt/C 


fnr plnnp 
1U1 L/1U11C 


2S882 






SEO 


TD 


NO* 


322 


Id 


the 

Lilt- 


H ptprm i n pH 

tit/ LCI UilllCU 


rDNA 


cpniienre 

dCU UC11CC 


fnr plnnp 

lUi t/lUllC 


2S883 






SEO 


TD 


NO* 

IN Vy. 


323 


1 C 
Id 


the 

L11C 


A ptpTTn i n pH 

tiCLCiiiliilt/ti 


rDNA 


c Pn 1 1 Pti r» p 
oCtiUCilCC 


1U1 t/lUllC 


X JooH". 






SEO 


TD 


NO- 


324 

JZ.*T 


ic 

Id 


the 

LUC 


H ptprm i n pH 

tlCLt'lillllit'tl 


rDNA 


c pn 1 1 pn r* p 
oCtlUCllCC 


Vrw r* 1 r\np 
1U1 L/lUllC 


ZJOOJ. 






SEO 


TD 


NO- 

in \j , 


32S 


ic 

Id 


the 


H ptprm i n pH 
tidt/i iiiiiiwti 


rDNA 


dCU UC11CC 


fnr plnnp 

lUi C1U11C 


9S886 

Z, JOOU. 






SEO 


TD 


NO- 

IN Vv. 


326 


id 


the 

Lilt/ 


H ptprm i n pH 

tlt/LCilllilit/ti 


rDNA 


cpnnpnpp 

dCtlUCllCC 


fr\r pIatip 
1U1 C1U11C 


Z JOO / . 


tit 




SEO 


TD 


NO* 


327 

*JZ< / 


id 


the 

L11W 


H etprm i n pH 

tlt/Ltrl 111111WU 


cDNA 


cpniipnrp 

dC\l U-Cllt/C 


fnr rlnnp 

1U1 ClVJllC 


2^888 






SEO 


TD 


NO* 


328 

.J o 


Id 


the 

LUC 


H ptprm i n pH 

tlt/lt/l llllllt/vl 


rDNA 


CPflllPTlPP 
dCU UC11CC 


fnr rlnnp 

1U1 t/lt/llC 


2^88Q 




1 s 


SEO 


TD 


NO- 


329 

~j l* y 


id 


the 

Lilt/ 


A ptprm i n eH 

Ut/LWLlillllt/tl 


rDNA 


cpnnpnpp 

oCU UC11CC 


fnr plnnp 

1U1 C1U11C 


2^8Q0 






SEO 


TD 


NO* 

in vy. 


330 


id 


the 

Lilt/ 


H eterm i n pH 

tlt/LVl llllllt/tl 


rDNA 


Qpmipnpp 

dCU UCllt/C 


fnr plnnp 

1U1 t/lUllC 


2^8Q2 








TD 

ijl/ 


NO- 

IN \J . 


331 


id 


the 

Lilt/ 


H ptprm i n pH 
tlC LCI iillliCU 


rDNA 


cpnnpnpp 
oCl^UCllUC 


fr*t* rlnnp 
1U1 C1U11C 


2^804 






SEO 


TD 


NO 

IN W. 


332 

-?JZ 


1 Q 
id 


thp 

11 1C 


A ptprm i n pH 

tlCLCl illiilCU 


rDNA 


dCl^UCIlCC 


1U1 t/lUllC 








SEO 


TD 


NO- 

IN \J . 


333 


id 


the 

Hit/ 


H ptprm i n pH 

UCLCl IillliCU 


rDNA 

t/lvlN/A. 


c pn i i pn p 1 p 
dCtJUCllCC 


1UI ClUIlC 


Z, JO-7Q. 


0 


zu 


SEO 


TD 


NO* 

IN \J . 


334 


id 


thp 

L11C 


H ptprm i n pH 
tlC LCI IillliCU- 


rDNA 


dCilUCllCC 


lOi t/lUllC 


2^8Q7 






SEO 


TD 


NO* 

IN Vv. 


33*S 

O J J 


1 Q 
id 


the 

Lilt/ 


H ptprm i n pH 

UCICi IlliilCU 


rDNA 


cpnnpnpp 
oCLJUCHCC 


1U1 C1U11C 


2S800 






SEO 


TD 


NO- 


336 


id 


the 
Hie 


H ptprm i n pH 

UCLCl IillliCU 


rDNA 

t/i_ylN i\ 


cp/11 1 pnpp 
dCU UCilCC 


tcw c lr\tip 
lUi ClVJllC 


9^Q00 






SEO 


TD 


NO- 

IN \J » 


337 


1 G 
id 


the 
nit 


H ptprm i n pH 
UCLCl IillliCU 


rDNA 


cpn 1 1 pnfp 
dCt[UCllt/C 


Yew r*lrtnp 
1U1 C1U11C 


9SQ01 






SEO 


TD 


NO- 

IN \J . 


338 

JJO 


1 C 
id 


the 

Lilt/ 


H ptprm i n pH 
UCLCl IillliCU 


rDNA 

Ci_y IN /A 


oCtlUCilt/C 


lUi C1U11C 


2^Q09 




ZJ 


SEO 


TD 
ii_/ 


NO- 

1N \J . 


33Q 


i c 

id 


thp 


A ptprm i n pH 
UCLCl iillliCU 


rDNA 


dCtJUCilLC 


IUI CiUIlC 


9^Q0^ 

ZJ7UJ . 






SEQ 


ID 


NO: 


340 


is 


the 


determined 


r\\T A 

cDNA 


sequence 


for clone 


25904. 






SEQ 


ID 


NO: 


341 


is 


the 


determined 


cDNA 


sequence 


for clone 


25906. 






SEQ 


ID 


NO: 


342 


is 


the 


determined 


cDNA 


sequence 


for clone 


25907. 






SEQ 


ID 


NO: 


343 


is 


the 


determined 


cDNA 


sequence 


for clone 


25908. 



25 



210121.471C11 







SEO 


ID 


NO- 


344 


ic fhp 


dptprminpH 

UVLVlllllllVU 


cDNA 


cpnnpnrp 

OvU UV11VV 


for rlonp 

XV/l VIVJ11V 


25909 
±* j y \j s . 






SEO 


TD 


NO* 




ic thp 

ID LI IV 


A ptprrn i n pH 

UVlVl 111111VU 


rDNA 


cpmipnrp 
avu uviivv 


for rlnnp 

XVI V1V11V 


25910 








TD 


NO* 


146 


ic thp 

LD 11 IV 


c\ ptprm i n pH 

UVlVl 11 111 1VU 


rDNA 


cpmipnrp 

DVU UVIIVV 


for rlnnp 

1U1 V1U11V 


2591 1 








TD 


NO* 

IN \J . 


147 


IC tllP 


(\ ptprm i n pH 
uviviniiiivu 


rDNA 


cpmipnrp 

DVU UVllVV 


frir rlnnp 

LKJL V1U11V 


25912 








TD 

LIS 


NO- 

IN vy . 


148 


Ic trip 

la 11 IV 


H ptprm i n prl 

UVlVlllliilVU 


rDNA 


cpmipnrp 

OvU UVllVV 


TTir rlnnp 
nji viuiiv 


2591 % 






SEO 


TD 


NO- 


149 


ic thp 

IO 11 IV 


H pfprm i n pH 

UVlVlllliilVU 


cDNA 


cpnnpnrp 
a vu uviivv 


fnr rlnnp 

1UI V1V11V 


25914 






SFO 


TD 


NO* 
in \j . 


ISO 

J J V 


ic trip 

la lllv 


H pfprm i n pH 

UVlVlllliilVU 


rDNA 


a vu uviivv 


fnr rlnnp 

LKJl VlUllV 


2591 5 






SFO 


TD 

XXJ 


NO- 
in kj . 


151 


1 C tVlP 
la Lllv 


H pfpt*rn i n pH 

Uvlvl llllllvU 


rDNA 

vlJlN/\ 


c pn 1 1 pn r p 
dv*^ Uviivv 


fnr rlnnp 
XUI vlUIlv 


2591 6 
JLJy l u. 






SFO 


TD 

LLJ 


NO- 


152 


ic thp 
Lo Lllv 


A pfprm i n pH 

UVlVl llllllvU 


rDNA 


cpmipnrp 
aviJjUviivv 


fnr rlnnp 

1U1 VlUllV 


2591 7 




1U 




TD 


NO- 


151 


I O tn P 
li> Lllv 


A pnn i n pH 
IXvlvI llllllvU 


rDNA 

vX^/lN/\ 


o p/ti 1 1 pn r p 
Dvt^ Uviivv 


T/*\t* rlr^np 
XUI vlUIlv 


ZJ7 1 O. 






SFO 


TD 

L±J 


NO- 


154 


1 C thp 
lo Lllv 


UC Lvl 1I1111CLL 


rDNA 

vJ_yiN/\ 


c p/i 1 1 pn r p 

acu^uviivv 


■flr^r r» |Anp 
XUI vlUIlv 








SFO 


TD 

LLJ 


NO- 

IN \J . 


155 

j j j 


1 C tVlP 
la Lllv 


f\ pfprm i n pH 

LLV LVl llllllvU. 


rDNA 


cpnnpnrp 
OVU Uviivv 


fnr rlnnp 
1UI vlUIlv 


25990 


yQ 
£P 




SFO 


TD 

LU 


NO* 

IN \J • 


156 


ic trip 

lo Lllv 


n pfprm i n pH 

Uvlvl 111111VU 


rDNA 


cpnnpnrp 

OVU UVllVV 


fnr rlnnp 

1U1 VlUilV 


25921 


if: 




SFO 


TD 


NO* 

IN VJ*. 


157 


ic trip 

lo Lllv 


H pfprm i n pH 

UVLVi 111111VU 


cDNA 


cpnnpnrp 
a vi^ uviivv 


fnr rlnnp 

XUI VlUllV 


25922 




1 c 


SFO 


TD 


NO- 

IN \J. 


158 

J J o 


ic thp 

lo lllv 


H pfprm inpH 

UVlVl llllllvU 


rDNA 


cpnnpnrp 
avt[ uviivv 


fnr rlnnp 

XUI vlUIlv 


95924 






SFO 


TD 

LLJ 


NO- 

IN V-/. 


15Q 

JJ7 


1 C thp 
Id Lllv 


HpfprminpH 
Uv Lvl llllllvU 


rDNA 


cpnnpnrp 

DVVJ] UVllVV 


fnr rlnnp 
XUI vlUIlv 


25925 


□ 




SFO 


TD 

LLJ 


NO- 

IN \J. 


160 

JUU 


1 c flip 
Id lllv 


A pt^rrri i n pH 
Uv Lvl llllllvU 


rDNA 


cpnnpnrp 
avi^uviivv 


fnr rlnnp 
XUI vlUIlv 


25926 






OCy 


TD 


NH- 


161 


i c tnp 
Id lllv 


A ptprminPn 
Uvlvl llllllvU 


rDNA 

VX-ZIN-TV 


c p/ni i i Pn r p 
avUUvIlvv 


tV\t* rlr\np 
XUI vlUIlv 


9S097 






SFO 


TD 

LLJ 


NO- 


16? 


ic thp 
id uiv 


n pfprm i n pH 

UVlVl llllllvU 


rDNA 


cpmipnrp 
avu uviivv 


fnr rlnnp 

XUI VlUilV 


95928 




zu 


SFO 


TD 

LLJ 


NO- 

IN \J . 


161 

JU J 


ic thp 
id uiv 


H pfprm i n pH 

UVlVl UllllVU 


rDNA 


cpmipnrp 
a vu u viiv v 


fnr rlnnp 

XUI VlUllV 


25929 






SFO 


TD 


NO- 


164 
juh 


i c tHp 
Id lllv 


A ptprfYl 1 Tl pH 
Uvlvl llllllvU 


rDNA 


cpnnpnrp 
avL^Uviivv 


fnr rlnnp 
xui viuiiv 


25910 

Z, JU. 






SFO 


TD 


NO- 

IN \J . 


165 


i c thp 
la lllv 


HpfprminpH 
UvlvlllllIlvU 


rDNA 

vl_yiNr\ 


o pn 1 1 pn r* p 
avLJUvIlvv 


for plnnp 
XUI vlUIlv 


25911 








TD 
XX_^ 


NO- 


166 

-?UU 


1 O tVl P 

la lllv 


UvlvlllllIlvU 


rDNA 


avC|Uvnvv 


xur viunc 


9^019 

ZJ>7JZ. 






SFO 


TD 


NO- 

IN \J. 


167 


la lllv 


UvlvlllllIlvU 


rDNA 
vX^lNrV 


c pn i i pn r* p 
dvC| Uviivv 


fr\f* rlnnp 
IUX vlUIlv 


95911 

IJyJJ. 




Z J 


SFO 


TD 
xx^ 


NO- 

IN \J. 


168 

JUO 


i c thp 
ia uiv 


HpfprminpH 
UvlvlllllIlvU 


rDNA 


avL^Uvllvv 


fnr rOnnp 
XUI vlUIlv 


25914 






SEQ 


ID 


NO: 


369 


is the 


determined 


cDNA 


sequence 


r i ^ 

for clone 


25935. 






SEQ 


ID 


NO: 


370 


is the 


determined 


cDNA 


sequence 


for clone 


25936. 






SEQ 


ID 


NO: 


371 


is the 


determined 


cDNA 


sequence 


for clone 


25939. 






SEQ 


ID 


NO: 


372 


is the 


determined 


cDNA 


sequence 


for clone 


32016. 



210121.471C11 







SEO 


TD NO- 






the 

lilt/ 


H eterm i n pH 

tit/ Lt/1 111111 Ctl 


rDNA 

t/l_-"iNiv 


Qennenre 


fr*r rlnne 

i-ty 1 t/ltFllt/ 


32021 






SEO 


ID NO* 


374 




the 

HIV 


H eterm i n eH 

tit/ Lt/1 llllllt/tl 


cDNA 


^eonenre 


for rlnnp 

ivl t/lfllt/ 


31993 






SEO 


ID NO: 


375 


i<; 

10 


the 


determined 

tit/ Iwl llllllt/tl 


cDNA 


<;pfnienre 


fnr rlnnp 

ivl t/ Y\J l It/ 


31997 






SEO 


ID NO: 


376 


is 


the 


determined 

Uw Ivl llllllt/tl 


cDNA 


seauence 

OvUUWlvv 


for rlnne 


31942 






SEO 


ID NO- 


377 


lO 


the 

Lilt/ 


d eterm i n ed 

tit/ Lt/1 11 111 Its tl 


cDNA 


Qpniienrp 

Ot/tl Ut/llt/C 


fnr rlnne 

Xt/l t/lVJUC 


31937 






SEO 


ID NO- 


378 


lo 


the 


d eterm i n ed 

tit/ Lt/1 llllllt/tl 


cDNA 


Qpniipnrp 

OCtJ Ut/llt/t/ 


fnr rlnnp 

Itll t/lt/UC 


319S2 






SFO 


TD NO- 


379 

i y 




the 

nit; 


d pfprm i ti pd 

tlC ICl llllllCtl 


rDNA 


Qpnnpnrp 

oCU UCIlt/C 


fnr rlnnp 

1U1 C1U11C 


31 99? 
J i yy£. 






SFO 


TD NO* 


380 


1 Q 
lo 


the 
inc 


d ptprm i n pd 
tit/ ici iniiit/ti 


rDNA 


cpnnpnpp 

OCU UC11CC 


fnr plnnp 

1U1 C1U11C 


31 961 
i yvj i . 






SEO 


TD NO* 


381 


1 Q 
lo 


the 
inc 


d ptprm i r\ pd 
tit/ it/ 1 iiuiit^ti 


rDNA 


cpnnpnr*p 

OCtl UCllt/C 


fnr plnnp 

1U1 C1VJ11C 


31 964 




in 


SFO 


TD NO- 


38? 


1 C 
lo 


thp 

11 1C 


d ptprm i n pd 

LLC ICl llllllt/tl 


rDNA 


c pn 1 1 pn r» p» 
oC*cj[UCIlCt/ 


fnr plnnp 
1U1 C1U11C 


^?00S 
jZUvj. 






SFO 


TD NO* 


383 


1 C 
lo 


thp 

LUC 


d ptprm "i n pd 

tlC ICl llllllCtl 


rDNA 

Cl-^lN/\ 


c pn i i £»n r* p 
oCt[UCIlCC 


fnr plntip 
1UI ClUIlC 


^1 PRO 








TD NO- 


384 


1 C 

lo 


thp 

LllC 


d ptprm inp/i 
UC LCI llllllCtl 


rDNA 


c f*n i i pn r» p 
oCt[UCllt/C 


1UI ClUIlC 


^1 OdO 


^0 




SFO 


TD NO- 


38S 


1 c 
lo 


thp 

111C 


d pfprm i n pd 
tic ICl llllllCtl 


rDNA 


cpniipnfp 

OCtj: UCllt/C 


fnr plnnp 
LKJl C1U11C 


39004 


~r= 




SFO 


TD NO* 


386 


lo 


thp 

111C 


A ptprm i n pd 
tic ici milieu 


rDNA 


c pn i i pn r* p 
oCtJUCilL/C 


fnr plnnp 
Itll ClUllC 


^1 9S6 

J 1 7JU. 


5 3^ 


i j 


SFO 


TD NO* 

1J_V IN v7 . 


387 

JO / 


1 C 

lo 


thp 

LllC 


d pfprm i n pd 

tlC ICl llllllCtl 


rDNA 


oCtj UC11CC 


fnr r* 1 nn p 
L\JL C1U11C 


31 9^4 






SFO 


TD NO* 

1J-/ IN V_/ • 


388 

JOO 


1 Q 

lo 


the 

Lilt' 


d pfprm i n pd 

tit/ ICl llllllCtl 


rDNA 


cpnnpnpp 

oCtl UCllt/C 


fnr pIatip 

lt/l C1U11C 


31 99R 

i yyo. 






SFO 


TD NO* 

J_L/ IN vy . 


389 


1 C 

lo 


thp 

11 1C 


d pfprm i n pd 

tlClCIllllllCU 


rDNA 


c pn 1 1 pn p 

oCU LXCIICC 


fnr plntip 
1U1 C1U11C 


^1 97^ 






SFO 


TD NO* 

LLJ IN \J . 


390 


lo 


thp 

Lilt/ 


d pfprm i n pd 

tlC LCI llllllCtl 


rDNA 


cpnupnrp 

oCtl UCllCC 


fnr rlntip 

1VJ1 CHJllC 


31 976 


: iLi 




SFO 


TD NO* 


391 


1 Q 
lo 


thp 

Lilt/ 


d pfprm inpd 

LLC LCI llllllCtl 


rDNA 

CL/1N/A. 


cpniipnrp 

oCt^UCllt/C 


fnr rlnnp 

lt/1 C1U11C 


319RR 

J 1 700. 


....... ..... 

0 1 J 




SFO 


TD NO- 

XXJ IN V^/ . 


392 

J7L 


lo 


the 

Lilt/ 


d pfprm i n pd 

tlC LCI llllllCtl 


rDNA 


Qpniipnrp 

oCtj UCllCC 


fnr rlnnp 

1VJ1 t/lt/llC 


31948 
J i >/*to. 






SFO 


TD NO- 

11--/ IN V_/ . 


393 


lo 


the 

nit/ 


d pfprm i n pd 

tlClCl llllllCtl 


rDNA 


QPflllpriPP 
oCU UCllCC 


fnr flnnp 

Itll C1U11C 


3201 3 






SFO 


TD NO* 

xxJ IN V_7 . 


394 


lo 


the 
nit/ 


d pfprm inpd 

tlCLCl llllllCtl 


rDNA 

t/l_y IN / v 


CPnilPrir»p 
oCU UCllt/C 


fnr plnnp 

1VJ1 vlUllC 


319R6 
j i you. 






SFO 


TD NO* 


39S 

j >/ j 


1 C 
lo 


thp 
nit/ 


d pfprm i n pd 
tlClClllllllCU 


rDNA 

t/JL/lN /A. 


c pn 1 1 pn r» p 
oCL^ UCllCC 


■fnr plr»riP 
1UU C1U11C 


^1 QS4 






SFO 


TD NO- 

AJ_/ IN KJ . 


396 
J yu 


lo 


thp 
nit/ 


d pfprm i n f*d 
UCLClllllllCtl 


rDNA 

CJ_^lN/\ 


oCt^UCllt/C 


fnr plnnp 
L\JL C1UI1C 


^1 QR7 




Z J 


SFO 


TD NO- 

11-/ IN V_/. 


397 


lo 


thp 

11 1C 


A ptprm i n pd 
tlC LCI llllllCtl 


rDNA 


cpnnpnr*p 
OCtJUCllCC 


fnr rlnnp 
1U1 ClUIlC 


1909Q 






SEQ 


ID NO: 


398 


is 


the 


determined 


cDNA 


sequence 


for clone 


32028. 






SEQ 


ID NO: 


399 


is 


the 


determined 


cDNA 


sequence 


for clone 


32012. 






SEQ 


ID NO: 


400 


is 


the 


determined 


cDNA 


sequence 


for clone 


31959. 






SEQ 


ID NO: 


401 


is 


the 


determined 


cDNA 


sequence 


for clone 


32027. 
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SEQ 


ID 


NO: 


402 


is 


the 


determined 

\i v Ivl 111 lllvU 


cDNA 


sennence 

OvUUvllvv 


for 


clone 


31957 


SEQ 


ID 


NO: 


403 


is 


the 


determined 

Uv ivl llllllvU 


cDNA 


sefliience 

OvUUvllvv 


for 


clone 


31950 


SEQ 


ID 


NO: 


404 


is 


the 


deterrnined 


cDNA 


seauence 

UVU UvllVV 


for 


clone 


32011. 


SEQ 


ID 


NO: 


405 


is 


the 


determined 

Ivl 11111 a WV^l 


cDNA 


seauence 

uvvj Uvlivv 


for 


clone 


32022. 


SEQ 


ID 


NO: 


406 


is 


the 


determined 

\-A V IV! lillllVVl 


cDNA 


seauence 

UVU UvllvV 


for 


clone 


32014. 


SEQ 


ID 


NO: 


407 


is 


the 


determined 

Vlw Ivl llllllvU 


cDNA 


seauenre 

O vvl Uvlivv 


for 


clone 

vlUllv 


31963 


SEO 


ID 


NO: 


408 


is 


the 


determined 

Ills Ivl llllllvU 


cDNA 


seauenre 

JvU Uvlivv 


for 

Xvl 


rlone 

v Ivl 1 V 


31989 


SEQ 


ID 


NO: 


409 


is 


the 


determined 

U V LvlllllllvVl 


cDNA 


seauence 

OvU Uvllvv 


for 

Ivl 


clone 

vlvllv 


32015 


SEQ 


ID 


NO: 


410 


is 


the 


determined 

V* V Ivl 111111VU 


cDNA 


seauence 

OvU Uvllvv 


for 


clone 


32002 


SEO 


ID 


NO- 


411 


is 


the 


determined 

Uw Ivl llllllvU 


cDNA 


spniipnrp 

OvU Uvllvv 


for 

1U1 


rlonp 

vlvllv 


J 1 7J7. 


SEO 


ID 


NO* 


412 


is 


the 


H pterin i n ed 

Uv Ivl llllllvU 


rDNA 


Qpniipnrp 


for 

Ivl 


rlonp 

vlvllv 


39003 


SEO 


TD 


NO* 


41 1 


is 
lo 


the 


d pterin i n pH 

Utlvl llllllCu 


rDNA 


Qpnnpnrp 


for 

1U1 


rlone 

vivllC 


31936 


SFO 


TD 


NO* 


414 




the 

11 lv 


Uvlvl 111111VU 


rDNA 


Ot/U Uvlivv^ 


for 

Ivl 


vlvllv 


37007 


SEO 


TD 


NO* 


41 5 


is 


the 


d pterm i n pd 

Uvlvl llllllvU 


rDNA 


OvUUtllvU 


for 


rlonp 

vlvllv 


31965 

J 1 7UJ. 


SEO 


ID 


NO* 


416 


IS 

IO 


the 


d pt prm i n pd 

Uvlvl llllllvU 


cDNA 


semipnrp 

OvU Uvllvv 


for 

1U1 


rlonp 

vlvllv 


31935 

•J 1 7J J. 


SEO 


TD 


NO* 


417 


is 


the 

nil' 


d ptprm i n pd 

U v Ivl llllllvU 


cDNA 


spniipnrp 

OvU Uvllvv 


for 

Ivl 


rlonp 

vlvllv 


32008 


SFO 


TD 

XX./ 


NO- 


41 8 


1 Q 


the 
ii it/ 


d ptprm i n pd 

Uv IvX 111111WU 


rDNA 


cemipnpp 


for 

Ivl 


plonp 

vlvllv 


31 966 


SEO 


TD 


NO* 

i ivy. 


419 

™ x y 


is 

lo 


the 

11 lv 


d ptprm i n pd 

Uv Ivl llllllvU 


rDNA 


Qpniipnr p 

OvUUvllvv 


for 

Ivl 


rlonp 


32070 


SEO 


ID 


NO* 


420 


is 

IO 


the 

lllv 


d etprm i n ed 

U v Ivl llllllvU 


cDNA 


spniipnrp 

OvU Uvlivv 


for 

Xvl 


rlonp 

vlvllv 


31971 

J17/ 1 . 


SEO 


ID 


NO* 


421 


is 

IO 


the 

lllv 


determined 

Uvlvl llllllvU 


cDNA 


semipnrp 

OvU Uvlivv 


for 

1U1 


rlonp 

vlvllv 


31977 


SEO 


ID 


NO* 


422 


is 


the 


determined 

Uvlvl llllllvU 


cDNA 

viz J- > n 


seauenre 

OvU Uvllvv 


for 

Ivl 


rlonp 

vlvllv 


31985 


SEO 


ID 


NO: 


423 


is 


the 


determined 

U v Ivl llllllvU 


cDNA 


seauence 

OvU Uvllvv 


for 

Ivl 


clone 

vlvllv 


32023 


SEO 


ID 


NO* 


424 


is 

IO 


the 
ii iv 


determined 

Uvlvl llllllvU 


cDNA 


seauenre 

OvU Uvl 1 v v 


for 

1U 1 


rlonp 

vlvllv 


31981 

J 1701. 


SEO 


ID 


NO* 


425 


is 


the 


determined 

Uvlvl llllllvU 


cDNA 

viy 1 > i\ 


OvU Uvllvv 


for 

XVI 


rlonp 

vlvllv 


37006 


SEO 


ID 


NO* 


426 


is 

IO 


the 


d ptprm inpd 

Uvlvl llllllvU 


cDNA 


spniipnrp 

OvU Uvllvv 


for 


rlonp 

vlvllv 


31991 


oJbQ 


ID 


NO: 


427 


is 


the 


determined 


cDNA 


sequence 


for 


clone 


t r\r\ c 

31995. 


SEQ 


ID 


NO: 


428 


is 


the 


determined 


cDNA 


sequence 


for 


clone 


32000. 


SEQ 


ID 


NO: 


429 


is 


the 


determined 


cDNA 


sequence 


for 


clone 


31990. 


SEQ 


ID 


NO: 


430 


is 


the 


determined 


cDNA 


sequence 


for 


clone 


31946. 
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SFO TD 


NO- 


411 

U 1 


jo tHp 

lo UlC 


H ptprm i n pH 

tlt/Ltl llllllGtl 


cDNA 


OttJLlt'lltt/ 


fnr rlnnp 

1U1 t/lt/llC 








SFO TD 


NO- 


419 


ic thp 


H ptprm i n pH 
tivit'i iiiiiit/ti 


cDNA 


^pnnpnrp 


fnr rlnnp 
if i t/itnit/ 


31941 

J 1 -7*T 1 . 








NO- 


411 


ic thp 
lo lilt/ 


H ptprm i n pH 

tlt/lt>l llllllt'tl 


rDNA 


Qpnnpnrp 

OvUUvllvv 


fnr rlnnp 
if i t/it/iit* 








SFO ID 


NO- 


434 

"T«/"T 


ic fhp 

lO lilt/ 


H ptprm i n pH 

UvLvl llllllt/tl 


cDNA 


^pnnpnrp 

O t/tJ tlt/1 It/ 1/ 


fnr rlnnp 

If 1 vlvllv 


31996 

-/ 1 77vt 






SFO TD 


NO- 


435 
~ +j 


io. trip 

lO lilt/ 


H ptprm in pH 
tit/iwi niiiit'ti 


cDNA 


Qpniipnrp 

o t/ tl U t/l It/ 1/ 


fnr rlnnp 
if i t'lf nt/ 


32010 

1 f . 






SFO ID 


NO* 


436 


ic trip 

lO lilt 


H ptprm inpH 

tlwlt/1 llllllt'tl 


cDNA 


Qpnnpnrp 

Ot/tlLlt/llt/t 


fnr rlnnp 
if i vif uc 


11974 






SFO ID 


NO* 


437 


ic the 

IO lilt 


H ptprm inpH 

tlVlt/l llllllt'tl 


cDNA 


Qpnnpnrp 

Ot/tJ Llt/llt/t/ 


fnr rlnnp 
if i t/if nt/ 








SFO ID 


NO* 


438 


ic trip 

IO HIV-' 


HptprminpH 

tlt/tt/l 11 llllt/ tl 


cDNA 
t'i_</i > i\. 


Qpnupnrp 

Ot/tJLlt/llt/t/ 


fnr rlnnp 
if i t/if nt/ 


11999 






SFO ID 


NO* 


439 


ic flip 


d et erm i n ed 
tit/tt/i inn it/ ti 


cDNA 


cprmpnrp 

O t/ tl U t/l 1 1/ 1/ 


fnr rlnnp 

If 1 vlUllv 


31949 




IV/ 


SFO TD 


NO* 


440 


lo 11 It/ 


H ptprm i n pH 
tit/ it/i inn it/ ti 


cDNA 


^pniipnrp 

OV/tl Ut/llt't/ 


fnr rlnnp 
if i t/ if nt* 


11947 






SFO ID 

tDL*t\J JUL/ 


NO* 
i> \j . 


441 


ic tnp 
io mt; 


HptprminpH 
tititi llllllt'tl 


rDNA 


oCU U.Cllt't' 


fnr rlnnp 

1UI t/IUllC 


11 994 






cpn td 


NO* 


44? 


1 C t VlP 

lo 11 ic 


H PTPtm 1 n pH 
tlClClIlllllCtl 


rDNA 


o p/n i i pi*i r* p 
oCt[UCIlt/C 


iUl t/IUllC 


11 QSR 

D 1 7JO. 






SFO TD 


NO* 


441 


1 C tnp 
lo 111C 


A e^t^rm i n pH 
tlClClIlilllCLl 


rDNA 


c P/n l i pn r* p 
oCt^UCIlt/C 


IUl t/lfllC 


11 07^ 

Jl7 / J. 


sO 




SFO TD 


NO- 


444 


ic thp 

lo Lllw 


H ptprm i n pH 

tlC LCI llllllt'tl 


rDNA 


oCt|UCllt/C 


fnr pinup 

Ifi t/IUllC 


11 984 




i s 


SFO TD 


NO* 


445 


ic thp 

lO lilt 


H ptprm i n pH 

tlCLClllllllCtl 


rDNA 


cpnnpnpp 

ot/t^Ut/llt/t/ 


fnr rlnnp 

L\Jl t/lf lie 


19094 






SFO TD 


NO- 


446 


1 C th P 

lo UlC 


r\ ptprm i n pH 


rDNA 


oCtJUCIlt/C 


fXr /-» 1 /ATI P 


11 Q79 


5 




^PO TD 


NO- 


447 


lo UlC 


UC LCIIlllIlCU 


t/LylN/\ 


c art 1 1 pn p 

oct|ucnt/C 


t*t\t* ^Iatip 

ior t/ione 


11 041 

J 1 *7*f J . 






SFO TD 


NO- 


448 


i c tn p 
lo 111C 


ri ptPTTYl 1 Tl pH 
tlClt'IllllIlCtl 


rDNA 


o p/i 1 1 pn r» p 
oCt|UCIlt/C 


IUl t/IUllC 


1901 8 






SFO TD 


NO- 


440 


i c tn p 
lo UlC 


uc Lcrimiicu 


rDNA 


c P/l 1 1 pn r« p 
oCt|UCIlt/C 


ior t/iu nc 




LJ 




SFO TD 


NO* 


450 


io thp 

lO lilt' 


HptprminpH 

tlCltl llllllt'tl 


rDNA 


cpmipnrp 

oCU UCllt/C 


fnT pIatip 
IUl t/lf lit 


19009 






SFO TD 


NO* 


451 


1Q tVlP 
lO lilt/ 


H ptprm i n pH 
tit/iti iiiint/ti 


rDNA 


Qpnnpnrp 

Ot/tl Ut'llt/t/ 


fnr rlnnp 

IUl t/IUllC 


19019 






SFO TD 


NO* 


452 


io tVlP 

lo lilt/ 


H ptprm i n pH 
ttt/it/i niiiit/ti 


cDNA 


Qpnnpnrp 

Ot/tJLlt'llt/t 


fnr rlnnp 
if i t/if nt; 


1209S 






SFO TD 

k7.L^V/ JJLx 


NO* 


453 


io thp 

lO lilt/ 


H ptprm i n pH 
tit/itiiiiint/ti 


cDNA 

t/l_-/l > A 


qpnnpnrp 

Ot/tlLlt/llt/t 


fnr rlnnp 
if i t/if lit/ 


11967 

-7 1 >/V7 / . 






SFO TD 


NO- 

IN V_/. 


454 


ic thp 
io nit; 


H ptprm i n pH 

tlt/LCl llllllt'tl 


rDNA 


cpnnpnr*p 

Ot/tjUt/llt/C 


fnr plntip 
If 1 t/IUllC 


11968 

J 1 7UO. 






SFO TD 


NO- 

IN v_/. 


455 


ic thp 

io lilt? 


H ptprm i n pH 
tit'it'iiiiiiit/ti 


rDNA 


ot/t[Ut/llCC 


fnr rlnnp 
IUl t/IUllC 








SEQ ID 


NO: 


456 


is the 


determined 


cDNA 


sequence 


for clone 


31951. 






SEQID 


NO: 


457 


is the 


determined 


cDNA 


sequence 


for clone 


31970. 






SEQ ID 


NO: 


458 


is the 


determined 


cDNA 


sequence 


for clone 


31962. 






SEQID 


NO: 


459 


is the 


determined 


cDNA 


sequence 


for clone 


32001. 
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SEO 


ID 


NO* 


460 


ic the 


determined 

Uvlvl 11111 IvVJ 


cDNA 

vl/lir\ 


cpfliience 

ovUUvllvv 


for clone 31953 

J.U1 vlvllv ~-J L ^* *J ^} . 






SEO 


ID 


NO* 


461 

i W 1 


jc the 

lO 11 IV 


determiried 

Uv IVL 1 11111VU 


cDNA 

vl/li/\ 


sennenee 

JvUUvllvv 


for done 3 1 944 

Ivl V1U11V ~J X ^ ~ T . 






SEO 


ID 


NO* 


462 


is the 


determined 

Uvlvl llllllvU 


cDNA 


ceniience 

OvU Uvllvv 


for clone 31825 

Ivl vlUllv — ' X \J *J « 






SEO 


ID 


NO* 


463 


is the 


determined 

Uvlvl 1 1 i 1 1 Ivvi 


cDNA 


cennence 


for clone 31828 

Xv/l V Ivl IV «J X \J KJ • 




5 


SEO 


ID 


NO: 


464 


is the 


determined 

Uvlvl 11111 IvU 


cDNA 


ceniience 

JvU Uvllvv 


for clone 31830 

Xvl V1U11V -J X \J mJ \J • 






SEO 


ID 


NO: 


465 


is the 


d eterm i n ed 

Uvlvl 111X11 VU 


cDNA 


ceniience 

OvU Uvllvv 


for clone 31841 

iVl VlUllV mJ X \J t X > 






SEO 


ID 


NO* 


466 

¥ U W 


ic the 

LO 11 IV 


d eterm i n ed 

Uvlvl XI 1111 VU 


cDNA 


cpmipnrp 

jvU Uvllvv 


for clone 31 847 

1U1 V1V11V J 1 O" / • 






SEO 


ID 


NO* 


467 


is the 


determined 

U Vl VI 1X1X11 VU 


cDNA 


seauence 

OVU Uvllvv 


for clone 31850 

Ivl vlUllv ~J X VJ ~s \J • 






SEO 


ID 


NO* 


468 


is the 


determined 

UV I VI XX1X11VU 


cDNA 


seauence 

OVU Uvllvv 


for clone 31852 

Ivl vlUllv *J V <J ~J • 




10 


SEO 


ID 


NO* 


469 


ic the 

xo uiv 


determined 

UVIV1111XX1VU 


cDNA 


cpfliienre 

OVU Uvllvv 


for clone 31 855 

Ivl vlUllv JV\j<J*J» 






SEO 


ID 


NO* 


470 


ic the 

io ixiv 


determined 

UVIV11111X1VU 


cDNA 


cpniienre 

OVU uvxxvv 


for clone 31 858 

XVX vlUllv JXO-'tJ. 






SEO 


ID 
1 1 / 


NO* 


471 


jc the 

lO LI IV 


determined 

Uvlvl 1111X1VU 


cDNA 


cennenrp 

OvUUvllvv 


for clone 31 861 

XVI vlUllv J X \J\J X . 


iac=? 




SEO 


ID 


NO* 


472 


is the 


determined 

U V I VI 111111 VU 


cDNA 


cpniienre 

OVU UVX1V V 


for clone 31 868 

1U1 VlUllV *J 1 KJyJKj . 






SEO 


ID 


NO* 


473 


is the 


determined 

U V IV1 XX1XX1VU 


cDNA 


cpfliienre 

OVU Uvllvv 


for clone 31870 

iVl VlUllV mS X (J / U. 




i s 


SEO 


ID 


NO' 


474 

¥ / ¥ 


jc the 


determined 

U V IV! 1 11111VU 


cDNA 

vl_/l > IV 


cpmipnrp 

OVU uvxxvv 


for clone 31 872 

XUl VlUllV J1U / « 






SEO 


ID 

XL-/ 


NO* 


475 


ic the 

lO lllv 


determined 

U V L VI 111X11VU 


cDNA 


cpfliipnre 

OVU uvxxvv 


for clone 31873 

XUX VIUXIV J 1U / J. 


~ 




SFO 


TD 


NO* 


476 

*T / U 


ic thp 


d ptprm i n pd 

UvLVl llllllvU 


rDNA 


cpnnpnpp 
ovU uvllvv 


for rlnnp ^1 R77 

1U1 vlUllv JIO / / . 


fn 
Ti I 






TD 


NO- 


477 


i c thp 

Lo LllV 


r\ p^t^rm i n f*fi 
uv L vl lllillvu 


pDMA 

V-L^i > r\ 


dCL| Uvllvv 


for rlnnp ^1 R1R 

1U1 vlUllv jlO/O. 


\M 






TD 


NO 

in vy. 


47R 


ic thp 

lo LllV 


d ptprm l n pd 

u V LVl llllllvU. 


rDNA 


cpmipnrp 

O VL[ UVllvV 


for rlnne ^1 RRS 

1U1 VlUllV JlOOJ. 


;' s 




SFO 


TD 


NO 

IN V./. 


479 


ic thp 

lo LllV 


d ptprm i n pd 

u V LVl lllillvu 


rDNA 


cpnupnrp 

OVU^ Uvllvv 


for rlnnp ^1 RRR 

1U1 VlUllV JlOOO. 






SEO 


TD 


NO* 

IN V_/. 


480 


ic the 

lo LllV 


d ptprm i n pd 

UV LVl 111111VU 


cDNA 


ceniienrp 

OVU UVllvv 


for rlonp 31 890 

lUl VlUllV J 107u. 






SEO 


TD 


NO- 


481 


ic the 

lO LllV 


d ptprm i n pd 

UVLVl 111111VU 


cDNA 


cpniipnrp 

JvUUvllvv 


for rlone 31 893 

1U1 VlUllv J1Q7J. 






SEO 


ID 


NO* 


482 
~ (j 


ic the 

1j LllV 


determined 

UV l VL 111111 VU 


cDNA 

vi/i.>/v 


cpnupnrp 

OvUUvllvv 


for clone 31 898 

XUX VXU11V -J 1 O -X tJ . 






SEO 


TD 


NO* 


483 


ic the 

Id LllV 


d ptprm i n pd 

Uvlvl 111111VU 


rDNA 

V1.-/1 >f /v 


CPflllPTlPP 
D VU Uvllvv 


for clone ^1 901 

1U1 VlUllv J 1/Ul. 




25 


SEO 


ID 


NO* 


484 


is the 


determined 

UVLVl 111111VU 


cDNA 


cpfliipnrp 

O vU Uvllvv 


for clone 31 909 

xui viuiiv ~j i s \j y . 








ID 


IN (J. 


4oD 


is tne 


determined 


CDJNA 


sequence 


tor clone 3191U. 






SEQ 


ID 


NO: 


486 


is the 


determined 


cDNA 


sequence 


for clone 31914. 






SEQ 


ID 


NO: 


487 


is the 


determined 


cDNA 


sequence 


for contig 1 . 






SEQ 


ID 


NO: 


488 


is the 


determined 


cDNA 


sequence 


for contig 2. 



30 
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SEQ 


ED 


NO: 


489 


is the 


determined 

V1VIV1 111111WU 


cDNA 


seauence 

UVVJ UV11V V 


for conti p 

1V1 Wllllg 


3. 






SEQ 


ID 


NO: 


490 


is the 


determined 


cDNA 


seauence 

u vvj UV1 1. V V 


for conti p 


4. 






SEQ 


ID 


NO: 


491 


is the 


determined 


cDNA 


sequence 


for conti p 


5. 






SEQ 


ID 


NO: 


492 


is the 


determined 


cDNA 


seauence 


for conti p 

Xvi wy x in c~ 


6. 




5 


SEQ 


ID 


NO: 


493 


is the 


determined 


cDNA 


seauence 


for conti p 

v 1111 


7. 






SEQ 


ID 


NO: 


494 


is the 


determined 


cDNA 


seauence 


for conti p 

ivi vvinig 


8. 






SEQ 


ID 


NO: 


495 


is the 


determined 

VlVlvl 11111XVU 


cDNA 


seauence 

O vU Uvllvv 


for conti p 


9 






SEQ 


ID 


NO: 


496 


is the 


determined 

Uv IV1 111111VU 


cDNA 


seauence 

O vU Uvllvv 


for conti p 

ivi vvsiinc^ 


10 






SEQ 


ID 


NO: 


497 


is the 


determined 


cDNA 


seauence 

OvU Uvllvv 


for conti p 

Ivi vV/llllg 


11 




10 


SEQ 


ID 


NO: 


498 


is the 


determined 

UVtVl 111111 VVl 


cDNA 


seauence 

UVU UVllVV 


for conti p 

1U1 VV/1111& 


12 






SEQ 


ID 


NO: 


499 


is the 


determined 

ViVlVl XI, 1.1. livv 


cDNA 


seauence 


for conti p 

IVI VUllllg 


13. 


Q 




SEQ 


ID 


NO: 


500 


is the 


determined 


cDNA 


seauence 

UVU U VllV V 


for conti p 


14. 






SEQ 


ID 


NO: 


501 


is the 


determined 


cDNA 


seauence 

UVVI Uvllvv 


for conti p 

1 VI Vvllllg 


15. 






SEQ 


ID 


NO: 


502 


is the 


determined 

\ivivl XX XX X IVvi 


cDNA 


seauence 

UVU UV11VV 


for conti p 

ivi vuinig 


16. 




15 


SEQ 


ID 


NO: 


503 


is the 


determined 


cDNA 


seauence 

JVU UVllVV 


for conti p 

XVyl VV11L1C. 


17. 






SEQ 


ID 


NO: 


504 


is the 


determined 


cDNA 


seauence 

JVU UV11 w 


for conti p 

IVI VU1111& 


18. 


B 




SEQ 


ID 


NO: 


505 


is the 


determined 


cDNA 


seauence 

JVVi UVllvV 


for conti p 


19. 


in : 




SEQ 


ID 


NO: 


506 


is the 


determined 

Uvlvl llllllvU 


cDNA 


seauence 

uvvj Uvllvv 


for conti p 

lvl VUllllg 


20 


m 




SEO 


ID 


NO* 


507 


is the 


determined 

UVlvl llllllvU 


cDNA 


seauence 

Ov\J Uvllvv 


for conti p 

ivi vUl 1 LI g 


21 




20 


SEO 


ID 


NO: 


508 


is the 


determined 

Uvlvl llllllvU. 


cDNA 


seauence 

OvU Uvllvv 


for conti p 


22 






SEO 


ID 


NO* 


509 


is the 


determined 

UvLvl llllllvU 


cDNA 


seauence 

OvU Uvllvv 


for conti p 

lul vUl 1 11 g 


23 






SEO 


ID 


NO* 


510 


is the 


determined 

Uvlvl llllllvU 


cDNA 


seauence 

uwU Uvllvv 


for conti p 


24 






SEQ 


ID 


NO: 


511 


is the 


determined 

VA V kvl llllllvU 


cDNA 


seauence 

Ovvl Uvllvv 


for conti p 

AVI vVSllllj^ 


25 






SEQ 


ID 


NO: 


512 


is the 


determined 

UVIV1 111111VU 


cDNA 


seauence 

O vU Uvllvv 


for conti p 

x\J 1 vvllll^ 


26 




25 


SEQ 


ID 


NO: 


513 


is the 


determined 

UvlVl 1A11AAVU 


cDNA 


seauence 

O vU Uvllvv 


for conti p 

vUllllg 


27 






obQ 


ID 


XTY"V 

NU: 


C 1 A 

j 14 


is the 


determined 


CDJNA 


sequence 


tor contig 


28. 






SEQ 


ID 


NO: 


515 


is the 


determined 


cDNA 


sequence 


for contig 


29. 






SEQ 


ID 


NO: 


516 


is the 


determined 


cDNA 


sequence 


for contig 


30. 






SEQ 


ID 


NO: 


517 


is the 


determined 


cDNA 


sequence 


for contig 


31. 
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SEQ 


ID 


NO: 


518 


is the 


determined 


cDNA 


seauence 


for contig 


32. 






SEQ 


ID 


NO: 


519 


is the 


determined 


cDNA 


seauence 


for contie 


33. 






SEQ 


ID 


NO: 


520 


is the 


determined 


cDNA 


sequence 


for contig 


34. 






SEQ 


ID 


NO: 


521 


is the 


determined 


cDNA 


seauence 


for contie 


35. 




5 


SEQ 


ID 


NO: 


522 


is the 


determined 


cDNA 


sequence 


for contig 


36. 






SEQ 


ID 


NO: 


523 


is the 


determined 


cDNA 


seauence 


for conti p 


37. 






SEQ 


ID 


NO: 


524 


is the 


determined 


cDNA 


seauence 

UVU UV11VV 


for conti p 

1V/1 VVllllC 


38. 






SEO 


ID 


NO: 


525 


is the 


determined 

UvlvlllHllvU 


cDNA 


seauence 

u vu Uvllvv 


for cnnri p 

ll/l VvXllXci 


39 






SEQ 


ID 


NO: 


526 


is the 


determined 

Uwlvl 111111VU 


cDNA 


seauence 

OvU Uvllvv 


for conti p 


40 




10 


SEQ 


ED 


NO: 


527 


is the 


determined 


cDNA 


seauence 

UVU uvllvv 


for conti p 

Xvl VUllllt 


41. 






SEQ 


ID 


NO: 


528 


is the 


determined 

UvLvl lllillvVi 


cDNA 


seauence 

Jvvj Uvllvv 


for conti p 

IVl VVllllg 


42. 






SEQ 


ID 


NO: 


529 


is the 


determined 

Vl VLV1 111111VU 


cDNA 


seauence 

OvU Uvllvv 


for conti p 


43. 


Tear 




SEQ 


ID 


NO: 


530 


is the 


determined 


cDNA 


seauence 

tjVU UVllVV 


for conti p 


44. 


sz 




SEQ 


ID 


NO: 


531 


is the 


determined 

Uvlvi llllllvU 


cDNA 


seauence 

JVU UvllVV 


for conti p 

1V/1 VVl IVl 


45. 


Li-i 


15 


SEQ 


ID 


NO: 


532 


is the 


determined 


cDNA 


seauence 

kJ VU UVllVV 


for conti p 

XVI vvllllg 


46. 






SEQ 


ID 


NO: 


533 


is the 


determined 


cDNA 


seauence 

OVU UvllVV 


for conti p 


47. 






SEQ 


ID 


NO: 


534 


is the 


determined 

vlv IVl llllllvU 


cDNA 


seauence 

OVU UVllVV 


for conti p 

IVl V U1XIX fm) 


48. 


5 
§ y 




SEQ 


ID 


NO: 


535 


is the 


determined 

UVIV1 111111VVI 


cDNA 


seauence 

JVU UVllW 


for conti p 

1U1 vvllllg 


49. 






SEO 


ID 


NO* 


536 


is the 


determined 

Uvlvl llllllvU 


cDNA 


seauence 

Jvu uvllvv 


for conti p 

ivy l vUllllc^ 


50 




20 


SEO 


ID 


NO: 


537 


is the 


determined 

Uvlvi llllllvU. 


cDNA 


seauence 

OvU Uvllvv 


for conti p 

XVI vVJllllg 


51 

•J X . 






SEO 


ID 


NO: 


538 


is the 


determined 

Uvlvl llllllvU 


cDNA 


seauence 

Ovvj Uvllvv 


for conti p 


52 






SEO 


ID 


NO* 


539 

~s <J S 


is the 


determined 

Uvlvl 1 1 1111 vU 


cDNA 


OvU Uvllvv 


for rnntip 

x\Jl vvllllii 








SEO 


ID 


NO: 


540 


is the 


determined 

Uvlvl llllllvU 


cDNA 


seauence 

JwU Uvllvv 


for c on tip 

XV/l VV/llllfci 


54 






SEO 


ID 


NO* 


541 


is the 


determined 

Uvlvl 11111 1 VU 


cDNA 


O vU U vl Ivv 


for rnntip 

Ivl vUl 1 11 Cl 






25 


SEQ 


ID 


NO: 


542 


is the 


determined 

Uvlvl llllllvU 


cDNA 


seauence 

OvU UVllvV 


for conti p 

XV/l vvllllg 


56 








ID 


TVTVV 

JNU. 




is the 


determined 


cDJNA 


sequence 


for contig 


CO 

58. 






SEQ 


ID 


NO: 


544 


is the 


determined 


cDNA 


sequence 


for contig 


59. 






SEQ 


ID 


NO: 


545 


is the 


determined 


cDNA 


sequence 


for contig 


60. 






SEQ 


ID 


NO: 


546 


is the 


determined 


cDNA 


sequence 


for contig 


61. 
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SEQ 


ID 


NO: 


547 


is 


the 


determined 

uviwl llllllvU 


eDNA seauence for contie 62 

Vl/l li » UVU UV11VV J-VA VvllllC. \J +m m 








SEQ 


ID 


NO: 


548 


is 


the 


determined 


cDNA seauence for contip 6^ 

Vl^l li » WVUUVllVV AVA Vvllllg \J • 








SEQ 


ID 


NO: 


549 


is 


the 


determined 


cDNA seauence for contie 64 








SEQ 


ID 


NO: 


550 


is 


the 


determined 

V*- V V VI X X lillvU 


cDNA seauence for contip 65 






5 


SEQ 


ID 


NO: 


551 


is 


the 


determined 

UVlvl lllillvU 


cDNA seauence for contip 66 

vl/l 1 1 \. O VU UV11VV Avl. VV/llllIl vv. 








SEQ 


ID 


NO: 


552 


is 


the 


determined 

V<* "W Ivl X 11111W 


cDNA seauence for contip 67 

V-i— 1. 1 A ^ JVU UV11VV XVI. Vvllllg V f • 








SEO 


ID 


NO: 


553 


is 


the 


determined 

Uvlvl ill lllwU 


cDNA seauence for contip 6R 

vl/l ^1 1\. OvU Uvllvv Ivl vvllllg V U • 








SEO 


ED 


NO: 


554 


is 


the 


determined 

Uvlvl llllllvU 


cDNA seauence for contip 69 

vivi in ovuuviivv ivi vviiiig \j y * 








SEO 


ID 


NO: 


555 


is 


the 


determined 

UVlVlllllllvU 


cDNA seauence for contip 70 

vl/llil OvU Uvllvv 1U1 Willi C, / v* 






10 


SEO 

kj J—' v^/ 


ID 


NO: 


556 


is 


the 


d eterm i n ed 

Uvlvl llllllvU 


cDNA seauence for contip 71 

vl/lixl OvU Uvllvv Ivl vUllllg / 1 • 








SEO 


ID 


NO: 


557 


is 


the 


determined 

Uvlvl llllllvU 


cDNA seauence for con tip 72 

vL/lin OvUUvllvv Ivl vUllllg / . 









O J— <v</ 


TD 


NO* 


558 


is 


the 

tills 


d eterm i n ed 

Uvlvl llllllvU 


eTY^A sennenee for ennticr 7^ 

vL/l> IV OvU Uvllvv Ivl vvllllci /J. 










TD 


NO* 


559 

*J ~J s 


ID 


the 

Lilt*' 


determined 

Uvlvl llllllvU 


eTY^A semienee for pontic 74 

vl/liA OvU Uvllvv 1 vi vUllllg / *T. 








SEO 


ED 


NO* 


560 


is 

IO 


the 
mv 


determined 

Uvlvl llllllvU 


cDNA seauence for contip 75 

vl/liri OvUUvllvv 1U1 v\Jllll& 1 *J • 






1 s 


SEO 


ID 


NO- 


561 


is 


the 


determined 

Uvlvl llllllvU 


cDNA seauence for contip 76 

vl/llA OvUUvllvv 1U1 vvllllcl / v. 








SEO 


ID 


NO: 


562 


is 


the 


determined 

Uvlvl llllllvU 


cDNA seauence for contip 77 

Vl/llA OvU UvllvV 1U1 vvllllg II, 




= 

f ss % 




SEO 


ID 


NO: 


563 


is 


the 


determined 

U V I VI llllllvU 


cDNA seauence for contip 78 

vL/ J. ^1 1\ jvuUvllvv Ivl vvllllg / \J . 




hj 




SEO 


ID 


NO: 


564 


is 


the 


determined 

Uvlvl llllllvU 


cDNA seauence for contip 79 

vl/liil OvUUvllvv Ivl vvllllc / S • 








SEO 


ID 


NO* 


565 

*J \J *s 


is 


the 


d eterm i n ed 

Uvlvl llllllvU 


pTYNTA sennenre for pontic &0 

vl_/l^li&. OvUUvllvv Ivl vvllllg Uv< 






90 




TD 

i_l/ 


NO* 


566 


Id 


the 

nit/ 


n et prm itipH 

Uvlvl 11 HUGH 


pTYNTA cpmipnpp few pr\nticr Rl 

vL/i> A oCUUCllvC 1U1 vUilLl*4 O 1 . 










TD 


NO* 


567 


is 


the 

Lilt/ 


n eterm itipH 

Uvlvl llllllvU 


pT^^TA cpniiptipp fnr pontic $11 








SEO 


ID 


NO* 


568 


is 


the 

11 Iv 


determined 

Uvlvl llllllvU 


cf)MA seanenre fnr enntip 8^ 

vl/liA OvUUvllvv Ivl vvllllg <J ~J . 








SEO 


ID 


NO* 


569 


is 


the 

11 Iv 


d eterm i n eH 

Uvlvl llllllvU 


eT^^A sennenre for elnne r^^ll - 

vL/l > A OvUUvllvv Ivl vlUllv vU 1 


■101 






SEO 


ID 


NO* 

i> v. 


570 


is 

ID 


the 

11 Iv 


d eterm i n ed 

Uvlvl llllllvU 


cDNA sequence for clone CS1- 


■102. 






SFO 


TD 


NO* 


571 


is 


the 

lllv 


H eterm in pd 

U&lvlllllllvU 


cDNA sequence for clone CS1- 


■104. 






SEQ 


ID 


NO: 


572 


is 


the 


determined 


cDNA sequence for clone CS1- 


-105. 






SEQ 


ID 


NO: 


573 


is 


the 


determined 


3' cDNA sequence for clone CS1-106, 






SEQ 


ID 


NO: 


574 


is 


the 


determined 


5' cDNA sequence for clone CS1-106. 






SEQ 


ID 


NO: 


575 


is 


the 


determined 


cDNA sequence for clone CS1-1 14. 



33 



SEQ ID NO: 576 is the determined cDNA sequence for clone CS1-1 18. 
SEQ ID NO: 577 is the determined cDNA sequence for clone CS 1-120. 
SEQ ID NO: 578 is the determined cDNA sequence for clone CS 1-123. 
SEQ ID NO: 579 is the determined 3' cDNA sequence for clone CS1-124 
SEQ ID NO: 580 is the determined 5' cDNA sequence for clone CS1-124 
SEQ ID NO: 581 is the determined cDNA sequence for clone CS 1-128. 
SEQ ID NO: 582 is the determined cDNA sequence for clone CS1-132. 
SEQ ID NO: 583 is the determined cDNA sequence for clone CS1-136. 
SEQ ID NO: 584 is the determined cDNA sequence for clone CS1-137. 
SEQ ID NO: 585 is the determined cDNA sequence for clone CS1-139. 
SEQ ID NO: 586 is the determined cDNA sequence for clone CS 1-141. 
SEQ ID NO: 587 is the determined cDNA sequence for clone CS1-152. 
SEQ ID NO: 588 is the determined cDNA sequence for clone CS1-154. 
SEQ ID NO: 589 is the determined cDNA sequence for clone CS1-156. 
SEQ ID NO: 590 is the determined cDNA sequence for clone CS1-158. 
SEQ ID NO: 591 is the determined cDNA sequence for clone CS1-160. 
SEQ ID NO: 592 is the determined cDNA sequence for clone CS1-168. 
SEQ ID NO: 593 is the determined cDNA sequence for clone CS1-169. 
SEQ ID NO: 594 is the determined cDNA sequence for clone CS1-171 . 
SEQ ID NO: 595 is the determined cDNA sequence for clone CS1-176. 
SEQ ID NO: 596 is the determined cDNA sequence for clone CS1-178. 
SEQ ID NO: 597 is the determined cDNA sequence for clone CS1-180. 
SEQ ID NO: 598 is the determined cDNA sequence for clone CS1-183. 
SEQ ID NO: 599 is the determined cDNA sequence for clone CS1-184. 
SEQ ID NO: 600 is the determined cDNA sequence for clone CS 1-187. 
SEQ ID NO: 601 is the determined cDNA sequence for clone CS 1-190. 
SEQ ID NO: 602 is the determined cDNA sequence for clone CS 1-194. 
SEQ ID NO: 603 is the determined cDNA sequence for clone CS 1-195. 
SEQ ID NO: 604 is the determined cDNA sequence for clone CS 1-196. 



34 
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SFO TD NO* 60S 






SFO TD NO- 606 






SFO TD NO* 607 






SFO TD NO* 608 






SFO ID NO* 609 

Lj LIS 1 i . \J\J S 






SFO ID NO- 610 






SFO TD NO- 61 1 






SEO ID NO* 612 






SFO TD NO* 61 1 






SFO TD NO* 614 






SFO TD NO* 61 5 






SFO TD NO* 616 






SFO TDNO* 617 






SFO TD NO* 61 8 




1 < 

I J 


SFO TD NO- 61 9 






SFO TD NO* 690 






SFO TD NO- 69 1 






SFO TD NO- 69? 


!i S * 




SFO TD NO* 62^ 


Q 




SFO TD NO- 624 






SFO TD NO* 62S 






SFO TD NO- 626 






SFO TD NO- 697 






SFO TD NO- 628 






SFO TD NO- 629 






SEQ ID NO: 630 






SEQIDNO:631 






SEQ ID NO: 632 






SEQ ID NO: 633 



s the determined cDNA sequence for clone CS1-197 
s the determined cDNA sequence for clone CS 1-200 
s the determined cDNA sequence for clone CS 1-206 
s the determined cDNA sequence for clone CS 1-207 
s the determined cDNA sequence for clone CS 1-234 
s the determined cDNA sequence for clone CS1-238 
s the determined cDNA sequence for clone CS 1-239 
s the determined cDNA sequence for clone CS 1-243 
s the determined cDNA sequence for clone CS 1-246 
s the determined cDNA sequence for clone CS 1-249 
s the determined cDNA sequence for clone CS 1-250 
s the determined cDNA sequence for clone CS 1-252 
s the determined cDNA sequence for clone CT502. 
s the determined cDNA sequence for clone CT507. 
s the determined cDNA sequence for clone CT521. 
s the determined cDNA sequence for clone CT544. 
s the determined cDNA sequence for clone CT577. 
s the determined cDNA sequence for clone CT580. 
s the determined cDNA sequence for clone CT594. 
s the determined cDNA sequence for clone CT606. 
s the determined cDNA sequence for clone CT607. 
s the determined cDNA sequence for clone CT599. 
s the determined cDNA sequence for clone CT632. 
is the determined cDNA sequence for clone 35691. 
is the determined cDNA sequence for clone 35707. 
is the determined cDNA sequence for clone CSE-2. 
is the amino acid sequence for clone CSE-2. 
is the determined cDNA sequence for clone CT2-1. 
is the determined cDNA sequence for clone CT2-6. 
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SEO 

V? J-> VV 


ID 


NO 

1 ^1 vv • 


634 


is 

io 


the 


determined 

VI V L VI 11111 IV VI 


cDNA 


semienre 

OVU Li VI 1 V V 


for 


clone 


CT2-8 

V_S X Am \J . 






SEO 


ID 


NO* 


635 

V w 


is 


the 
mi/ 


d eterm i n ed 

V1VLV1 111111 vU 


cDNA 


spmipnrp 

O VVJ 11 VI IV V 


for 

XVI 


r lone 

V1V11V 


CT2-9 

\s 1 V . 






SEO 

k_J 1-/ VV^ 


ID 
i iv 


NO- 

1 ^1 w • 


636 


IS 

ID 


the 

111V 


d eterm l n ed 

V1VIV1111111VV1 


cDNA 

VXVl ^ A 


spnnenre 

OVVJUV11VV 


for 

1 VI 


rlnne 

V1V11V 


CT2-12 






SEO 


ID 


NO 


637 

V V f 


IS 


the 
mv 


d eterm i n ed 

VI V I VI 111111 V VI 


cDNA 

vL/l ill 


semienre 

OVU UV11VV 


for 

XVI 


rlnne 

V 1 VI IV 


CT2-15 






SEO 

ivx-fvv^ 


ID 


NO 

1 ^1 w • 


638 


is 

io 


the 


d eterm i n ed 

VLV I VI 11111 1 V Vl 


cDNA 


senuence 

OVU UV11VV 


for 

1 VI 


clone 

V 1 VI IV 


CT2-16 

1 ±* IV. 






SEQ 


ID 


NO: 


639 


is 


the 


determined 

UV Ivl llllllvvi 


cDNA 


seauence 

OVU Uvllvv 


for 


clone 


CT2-17 

V_^ 1 ^ If. 






SEO 

v_J 1— < VV^ 


ID 


NO* 

1 1 w • 


640 


is 


the 


d eterm i n ed 

VI V I VI 111111VV1 


cDNA 


semienre 

OVU U VI 1 V V 


for 

XVI 


rlnne 

V 1 VI IV 


CT2-19 

V_/ 1 IV. 






SEO 

kj i— ^ vv 


ID 


NO- 

1 1 w . 


641 


is 


the 


determined 

Uv Ivl 111111VV1 


cDNA 


semienre 

OVU UV11VV 


for 

1 VI 


rlnne 

V1V11V 


CT2-23 

V X V . 






SEO 


ID 

XXV 


NO- 

1 t| vv . 


642 


IS 
ID 


the 

LI IV 


d eterm i n ed 

V1VIV1111111VV1 


cDNA 

VXV1>>TY 


Qpnnpnpp 

O VVJ Uvll V V 


for 

1 VI 


r lonp 

V1V11V 


TT2-25 

V_^ X X, V . 




10 


SEO 


ID 


NO' 


643 


is 


the 


determined 

VI V I VI 111111VV1 


cDNA 

vl/li/\ 


spmipnrp 

OVU UV11VV 


for 

1 VI 


rlnnp 

V1V11V 


TT2-27 






SEO 

LJXjV^ 


ID 

lis 


NO* 


644 


is 

IO 


the 
ii iv 


determined 

VI V l VI 1 1111 IV VI 


cDNA 


semienre 

OVU UV11VV 


for 

1 VI 


rlnnp 

V1V11V 


v_x 1 v v . 






SEO 

VX-fVV^ 


ID 


NO- 

i ^ vv. 


645 

vtv 


is 

IO 


the 

11 IV 


determined 

VI V I VI 111111VV1 


cDNA 


semienre 

OVVJ UV11VV 


for 

1V1 


rlnne 

V1V11V 


CT2-39 

\s 1 x« v V . 






SEO 


ID 


NO* 


646 


is 

ID 


the 

L11V 


determined 

VI V I VI 111111VV1 


cDNA 

V1V1 > l\. 


semienre 

OVU Uvllvv 


for 

1V1 


rlnne 

V1V11V 


CT2-41 

V_> X ^ tli 






SEO 

kJX-<VV 


ID 

lis 


NO- 

i w. 


647 
v^ / 


is 

ID 


the 

11 lv 


d eterm i n ed 

VI V I VI 11111 IV VI 


cDNA 


semienre 

OVVJ UVll vv 


for 

1V1 


rlnnp 

V1V11V 


V_v X *TJ . 


h* 


1 < 
1 J 


SEO 

OJ-/V^ 


TD 

XXV 


NO* 
i> vv. 


648 


lo 


the 
ii it/ 


d pfprm i n pH 

VI V IVl 1 lllllVtL 


rDNA 


CPflllPTl f*P 
OVVJ U-VllVV 


for 

Ivl 


plnnp 

VlUllV 


PT2-44. 






SFO 


TD 

XXV 


NO- 

1> vv. 


649 


lo 


the 
nit/ 


d pfprm i n pH 

V1VIV1 111111VV1 


rDNA 

VXV1>-TV 


cpmipn r*p 

OVVJ uivllvv 


for 

ivl 


plnnp 
vlUllv 


TT2-51 


a 




SFO 


TD 

XXV 


NO 

i> vv. 


650 


lo 


the 

UlV 


d pfprm i n pH 

VlVlVlllllllVU. 


rDNA 

VXVlN^V 


Qpnupnpp 

aCU U. VJ.lt/ v 


for 

Ivl 


plnnp 

VHJllV 


V_/ 1 L"jT, 






SFO 


TD 

XXV 


NO* 


651 

vv 1 


lo 


the 

Lilt/ 


d pfprm i n pH 
uviviiiiiiivii 


rDNA 

VXV1>I /\ 


cpniiPTipp 
ovu Uvllvv 


fnr 


vlvllv 


TT2-55 


03 




SFO 


TD 

XXV 


NO* 

i> vv. 


652 


is 


the 
nit/ 


d eterm i n pH 

VI V IV! 111111VV1 


rDNA 

VXVl 


Qpnnpnpp 

OVIJ U-VllVV 


for 

ivl 


plnnp 

VlVllV 


TT2-57 






SEO 


TD 

,1, IV 


NO- 

1>( w. 


653 


is 

IO 


the 
ii iv 


detprmi npd 

VI V l VI 1 1 1 1 1 1VV1 


cDNA 

VXVl > i\ 


spmipnrp 


for 

1V1 


rlnne 

VI V11V 


TT2-60 

V_/ X 6 UV. 






SFO 


TD 

XXV 


NO- 

1> vv. 


654 

v v~ 


is 

io 


the 

111V 


dptprminpd 

VI VI VI 11 111 It/ VI 


rDNA 

VXVl > IV 


Qpmipnrp 

OVVJ U VI IV V 


for 

XVI 


rlnnp 

VlVllV 


TT2-64 

V_^ X ^~V*T. 






SEO 


TD 

XXV 


NO- 

1^1 w. 


655 

UJ V 


is 

io 


the 

111V 


determined 

V1VIV1111111VV1 


cDNA 

VXVl > A 


spmipnrp 

OVVJ UVll vv 


for 

1V1 


rlnnp 

VI VI IV 


TT2-67 

V_/ X Z. V / . 






SFO 


TD 

XXV 


NO* 


656 

V W 


lo 


the 

111V 


d pfprm i n pd 

VIVlVlllllllVVl 


rDNA 

VXVl > /V 


Qpnnpnr p 

OVVJ uvllvv 


for 

lvx 


t/lvllv 


TT2-6R 

V_/ 1 Zr~UO. 






SEO 


ID 


NO* 

1 ^( w • 


657 


is 

IO 


the 


determined 

VI V L VI 11111 IV VI 


cDNA 


semienre 

OVU Uvllvv 


for 

1V1 


rlnne 

V1V11V 


CT2-75 




25 


SEO 


ID 


NO* 

1 t( vv • 


658 


is 

IO 


the 


determined 

VI V L VI 11111 IV VI 


cDNA 


semienre 

OVU Uvllvv 


for 


rlnne 

V1V11V 


CT2-79 








ID 


\TA. 

JNU: 


coy 


is 


the 


determined 


CDNA 


sequence 


ior 


clone 


CT2-109. 






SEQ 


ID 


NO: 


660 


is 


the 


determined 


cDNA 


sequence 


for 


clone 


CT2-H2. 






SEQ 


ID 


NO: 


661 


is 


the 


determined 


cDNA 


sequence 


for 


clone 


CT2-127. 






SEQ 


ID 


NO: 


662 


is 


the 


determined 


cDNA 


sequence 


for 


clone 


CT2-129. 
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SEQ 


ID 


NO: 


663 


is 


the 


determined 


cDNA sequence 


for clone CT2-156. 


SEQ 


ID 


NO: 


664 


is 


the 


determined 


cDNA sequence 


for clone CT2-162. 


SEQ 


ID 


NO: 


665 


is 


the 


determined 


cDNA sequence 


for clone CT2-167. 


SEQ 


ID 


NO: 


666 


is 


the 


determined 


cDNA sequence 


for clone CT2-169. 


SEQ 


ID 


NO: 


667 


is 


the 


determined 


cDNA sequence 


for clone CT2-172. 


SEQ 


ID 


NO: 


668 


is 


the 


determined 


cDNA sequence 


for clone CT2-173. 


SEQ 


ID 


NO: 


669 


is 


the 


determined 


cDNA sequence 


for clone CT2-174. 


SEQ 


ID 


NO: 


670 


is 


the 


determined 


cDNA sequence 


for clone CT2-177. 


SEQ 


ID 


NO: 


671 


is 


the 


determined 


cDNA sequence 


for clone CT2-181. 


SEQ 


ID 


NO: 


672 


is 


the 


determined 


cDNA sequence 


for clone CT2-191. 


SEQ 


ID 


NO: 


673 


is 


the 


determined 


cDNA sequence 


for clone CT2-192. 


SEQ 


ID 


NO: 


674 


is 


the 


determined 


cDNA sequence 


for clone CT2-207. 


SEQ 


ID 


NO: 


675 


is 


the 


determined 


cDNA sequence 


for clone CT2-222. 


SEQ 


ID 


NO: 


676 


is 


the 


determined 


cDNA sequence 


for clone CT2-223. 


SEQ 


ID 


NO: 


677 


is 


the 


determined 


cDNA sequence 


for clone CT2-233. 


SEQ 


ID 


NO: 


678 


is 


the 


determined 


cDNA sequence 


for clone CT2-244. 


SEQ 


ID 


NO: 


679 


is 


the 


determined 


cDNA sequence 


for clone CT2-257. 




TT> 


NO: 


680 


1 c 
IS 


inc 


ueLcrrnineu 


cDNA sequence 


for clone CT2-279. 


SEQ 


ID 


NO: 


681 


is 


the 


determined 


cDNA sequence 


for clone CT2-288. 


SEQ 


ID 


NO: 


682 


is 


the 


determined 


cDNA sequence 


for clone CT2-291. 


SEQ 


ID 


NO:683 


is 


the 


full-length 


cDNA sequence for human PAC (SI 



18 and 19). 

SEQ ID NO:684 is the full-length cDNA sequence for murine homologue of human 
PAC (SEQ ID NO: 683). 

SEQ ID NO: 685 is the predicted amino acid sequence for the clone of SEQ ID 
NO:683. 

SEQ ID NO:686 is a longer determined cDNA sequence for clone CoSub-19 (SEQ 
ID NO: 138). 
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SEQ ID NO:687 is the predicted amino acid sequence for the clone of SEQ ID 
NO:686. 

SEQ ID NO:688 is the nucleotide sequence of the Ml 3 forward primer. 

SEQ ID NO:689 is the nucleotide sequence of the Ml 3 reverse primer. 

SEQ ID NO: 690 is a longer determined cDNA sequence for C799P (SEQ ID 
NO:40), showing homology to homo sapiens NADH/NADPH thyroid oxidase pl38-tox 
mRNA. 

SEQ ID NO:691 is a longer determined cDNA sequence for C794P (SEQ ID 
NO:41). 

SEQ ID NO:692 is the predicted amino acid sequence for the clone of SEQ ID 
NO:690. 

SEQ ID NO:693 is the predicted amino acid sequence for the clone of SEQ 

IDNO:691. 



SEQ 


ID NO: 


694 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:A03. 


SEQ 


ID NO: 


695 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:A10. 


SEQ 


ID NO: 


696 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:A11. 


SEQ 


ID NO: 


697 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:A12. 


SEQ 


ID NO: 


698 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:B03. 


SEQ 


ID NO: 


699 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:B04. 


SEQ 


ID NO: 


700 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:B09. 


SEQ 


ID NO: 


701 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:B10. 


SEQ 


ID NO: 


702 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:B11. 


SEQ 


ID NO: 


703 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:B12. 


SEQ 


ID NO: 


704 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:C01. 


SEQ 


ID NO: 


705 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:C03. 


SEQ 


ID NO: 


706 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:C04. 


SEQ 


ID NO: 


707 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:C06. 


SEQ 


ID NO: 


708 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:C08. 


SEQ 


ID NO: 


709 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0093:C09. 
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SEO 


ID 

1J-S 


NO 


710 


lo 


the 

11 IV/ 


detenYiined 

Vi V/ 1 Vvl 11 1 1 1 1 V/ VI 


SEO 


ID 


NO* 


711 


to 


the 


deterrnineH 

V1V>IV/1 111111V/V1 


SEO 


ID 


NO* 


712 


to 


the 

111V/ 


d etprm i n pH 

VlV/lV/l 111111V/V1 


SEO 


TD 


NO* 


713 


i<s 

io 


the 

11 IV/ 


d etprm i n pH 

VlV/UV/l 111111V/V1 


SEO 


TD 


NO* 

in yj . 


714 


lo 


the 


d etprm i n pd 

UCLCl llllllCU 


SEO 


TD 


NO* 

IN W . 


71 5 


lo 


the 


d f*tf*rm i n pd 

UCLCIllllllCU 


SEO 


TD 

LLJ 


NO- 


716 


lo 


the 

UlC 


d ptprnn i n pd 

UCLCIllllllCU 


SEO 


TD 


NO- 


717 

fir 


ic 

lo 


the 

Lilt- 


d ptptm i ti pd 

VIC LCI llllllCU 


SEO 


TD 

LLJ 


NO- 


71 8 

/ 1 o 


lo 


thp 

UlC 


d ptprtn i n pd 

UCLCl llllllCU 


SEO 


TD 
llj 


NO* 


71 0 

1 LV 


1 c 
lo 


thp 


A atprm i n Pfi 
UCLCIllllllCU 


SEO 


TD 

LLJ 


NO* 


790 


1 c 


thp 

UlC 


A pfprm i ti pH 
UCLCIllllllCU 




TD 




791 


1 o 

lo 


tVip 
ulc 


A pfprm i ti pH 

ucicnmiicu 


SEO 


TD 
llj 


>JO- 

lN"o". 


79? 


1 c 
lo 


thp 
UlC 


A pfprm 1 TI Prl 

UCLCIllllllCU 


SEO 


TD 

LLJ 


l\JO- 

IN V_/. 


79^ 


1 c 
lo 


thp 

11 1C 


A ptpnn i ti pH 
UC LCI llllllCU 


SEO 

Oi-/y 


TD 


NO- 


794 


iO 


the 

u it/ 


d ptprm i n pd 

UCLCIllllllCU 


SEO 


TD 
llj 


NO- 

IN V_/. 


79 S 


lo 


thp 

LUC 


f\ Pt PtTYl 1 n pH 
UC LCI llllllCU 


SEO 


TD 

LLJ 


NO* 

IN V_/. 


796 


lo 


thp 

LUC 


A pt pitn i n pd 

UCLCIllllllCU 


SEO 


TD 


NO* 

IN V_/. 


797 

/ Zr / 


lo 


thp 

UlC 


A ptfrm i ti p/i 
UCLCIllllllCU 


SEO 


TD 

LLJ 


NO* 

IN \J* 


798 


1 o 
lo 


thp 

UlC 


A ptpTTYl 1 TI P/l 

UCLCl llllllCU 


SEO 


TD 

LLJ 


NO- 

IN V_/. 


79Q 


1 c 
lo 


thp 

UlC 


A pfprm i n p*A 
UCLCIllllllCU 


SEO 


TD 

LLJ 


NO- 


7^0 


1 c 
lo 


thp 

UlC 


A pfprm i n pH 
UCLCl llllllCU 


SEO 


TD 


NO* 

IN \J. 


7^1 
/ ji 


1 c 
lo 


thp 

UlC 


A fkf oTTYi i n pd 
UCLCIllllllCU 


SEO 


TD 

LLJ 


NO* 

IN \J. 


7^9 


lo 


thp 

UlC 


A pfpirri i n pH 
UCLCl llllllCU 


o£y 


TD 


NO* 


7^ 


1 o 

lo 


thp 

inc 


A pfprm i npH 

uciciiiiincu 




TD 


NO* 


7^4 


1 c 

lo 


thp 
UlC 


A pfprm itiprl 
UClCl llllllCU 


SEQ 


ID 


NO: 


735 


is 


the 


determined 


SEQ 


ID 


NO: 


736 


is 


the 


determined 


SEQ 


ID 


NO: 


737 


is 


the 


determined 


SEQ 


ID 


NO: 


738 


is 


the 


determined 



cDNA sequence for clone R0093:C10. 
cDNA sequence for clone R0093:C1 1 . 
cDNA sequence for clone R0093:C12. 
cDNA sequence for clone R0093:D01. 
cDNA sequence for clone R0093:D02. 
cDNA sequence for clone R0093:D03. 
cDNA sequence for clone R0093:D04. 
cDNA sequence for clone R0093:D05. 
cDNA sequence for clone R0093:D06. 
cDNA sequence for clone R0093:D07. 
cDNA sequence for clone R0093:D08. 
cDNA sequence for clone R0093:D10. 
cDNA sequence for clone R0093:D1 1 . 
cDNA sequence for clone R0093:E02. 
cDNA sequence for clone R0093:E03. 
cDNA sequence for clone R0093:E04. 
cDNA sequence for clone R0093:E06. 
cDNA sequence for clone R0093:E07. 
cDNA sequence for clone R0093:E08. 
cDNA sequence for clone R0093:E09. 
cDNA sequence for clone R0093:E10. 
cDNA sequence for clone R0093 :E1 1 . 
cDNA sequence for clone R0093:F02. 
cDNA sequence for clone R0093:F03. 
cDNA sequence for clone R0093:F04. 
cDNA sequence for clone R0093:F05. 
cDNA sequence for clone R0093:F06. 
cDNA sequence for clone R0093:F08. 
cDNA sequence for clone R0093:F09. 
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SEQ 


ID 


NO: 


739 


is 


the 


determined 


SEQ 


ID 


NO: 


740 


is 


the 


determined 

Uvlvl llllllvVi 


SEQ 


ID 


NO: 


741 


is 


the 


determined 

V&VLvl llllllvii 


SEQ 


ID 


NO: 


742 


is 


the 


determined 

Uvlvl 11X111 V VI 


SEQ 


ID 


NO: 


743 


is 


the 


determined 


SEQ 


ID 


NO: 


744 


is 


the 


determined 


SEQ 


ID 


NO: 


745 


is 


the 


detennined 


SEQ 


ID 


NO: 


746 


is 


the 


determined 


SEQ 


ID 


NO: 


747 


is 


the 


determined 

VI VI VI 11111 IwU 


SEO 


ID 


NO* 


748 


is 


the 


determined 

UV Ivl 111111 vvl 


SEO 


ID 


NO* 


749 


is 


the 

HIV 


determined 

Uvlvl 11111 IvU 


SEO 


ID 

XX-X 


NO* 


750 


is 

IO 


the 

lllv 


H eterm i n ed 

Uv Ivl llllllvvl 


SEO 


ID 


NO- 


751 




the 
mi/ 


d eterm i n ed 

vl v Ivl 111111 vll 


SEO 


ID 


NO* 


752 


is 


the 

tl lv 


d eterm i n ed 

vl v Lvl llllllvvl 


SEQ 


ID 


NO* 


753 


is 


the 


determined 

Vl v lvl 111X11 vvl 


SEO 


ID 


NO* 


754 


is 


the 

LI lv 


d eterm i n ed 

Uv Lvl llllllvvl 


SEO 


ID 


NO* 

X>( v_/. 


755 


is 


the 
u iv 


d eterm i n ed 

Uv Lvl llllllvvl 


SEQ 


ID 


NO: 


756 


is 


the 


determined 

Vl v Lvl X1XXX1 vvl 


SEQ 


ID 


NO: 


757 


is 


the 


determined 

VI V lvl 11111 1 V VI 


SEQ 


ID 


NO: 


758 


is 


the 


determined 

V* V lvl 111111VU 


SEQ 


ID 


NO: 


759 


is 


the 


determined 

V* V lvl 111111VU 


SEQ 


ID 


NO: 


760 


is 


the 


determined 

VI V lvl 111111 VU 


SEQ 


ID 


NO- 


761 


is 


the 


determined 

vl v Lvl 11111 IvU 


SEO 


ID 


NO- 


762 


is 


the 

Lllv 


determined 

Uvlvl llllllvvl 


SEQ 


ID 


NO- 


763 


is 


the 


determined 

vl v Lvl llllllvvl 




TT\ 
1L> 


JNU: 


/o4 


is 


the 


determined 


SEQ 


ID 


NO: 


765 


is 


the 


determined 


SEQ 


ID 


NO: 


766 


is 


the 


determined 


SEQ 


ID 


NO: 


767 


is 


the 


determined 



cDNA sequence for clone R0093:F10. 
cDNA sequence for clone R0093:F12. 
cDNA sequence for clone R0093:G01. 
cDNA sequence for clone R0093:G03. 
cDNA sequence for clone R0093:G04. 
cDNA sequence for clone R0093:G06. 
cDNA sequence for clone R0093:G07. 
cDNA sequence for clone R0093:G08. 
cDNA sequence for clone R0093:G09. 
cDNA sequence for clone R0093:G10. 
cDNA sequence for clone R0093:G1 1. 
cDNA sequence for clone R0093:G12. 
cDNA sequence for clone R0093:H02. 
cDNA sequence for clone R0093:H03. 
cDNA sequence for clone R0093:H04. 
cDNA sequence for clone R0093:H05. 
cDNA sequence for clone R0093:H07. 
cDNA sequence for clone R0093:H08. 
cDNA sequence for clone R0093:H09. 
cDNA sequence for clone R0093:H10. 
cDNA sequence for clone R0093:H1 1. 
cDNA sequence for clone R0094:A03. 
cDNA sequence for clone R0094:A05. 
cDNA sequence for clone R0094:A06. 
cDNA sequence for clone R0094:A07. 
cDNA sequence for clone R0094:A09. 
cDNA sequence for clone R0094:A10. 
cDNA sequence for clone R0094:A12. 
cDNA sequence for clone R0094:B03. 
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SEO 


ID 

xi_y 


NO- 


768 


is 


the 

11 IV 


d eterm i n pd 

UVIV1 llllllvU 


SEQ 


ID 


NO* 


769 


is 


the 


determined 

UVvlVi 111111VU 


SEQ 


ED 


NO: 


770 


is 


the 


determ i n ed 

Uvlvl 11X11 1 VvJ 


SEQ 


ID 


NO: 


771 


is 


the 


deterrnined 

Uvlwl 11111 1 VVl 


SEQ 


ID 


NO- 


772 


is 


the 


d eterm i ned 

Uvlvl 11111 lvVJ 


SEQ 


ID 


NO: 


773 


is 


the 


determined 

VI w Iwl 11111 


SEQ 


ID 


NO: 


774 


is 


the 


determined 

VJv Lvl 111111VU 


SEQ 


ID 


NO: 


775 


is 


the 


determined 

VI V lvl llllllvU 


SEQ 


ID 


NO* 


776 


is 


the 


determined 

Uv lvl 111111VU 


SEQ 


ID 


NO- 


777 


is 


the 


determined 

Uwlvl 111111VU- 


SEO 


ID 


NO* 


778 


is 

ID 


the 
mv 


d eterm i n ed 

UV l vl 111111 vU 


SEO 


ID 


NO* 

X^l V^f. 


779 

IIS 


is 


the 

11 IV 


d eterm i n ed 

Uvlvl 111111VU 


SEO 


TD 


NO* 


780 


is 


the 

11 IV 


H pfprm i n pd 

UVlvl llllllvU 


SEO 


TD 


NO* 


781 

/ U 1 


is 


the 
mv 


H eterm i n ed 

Uvlvl 111111VU 


SEO 


TD 


NO- 


782 


IS 
lo 


the 

nit 


H pfprm i n pd 

UVlVlllllllVU 


SFO 


TD 


NO* 


781 


IS 


the 

11 IV 


n ptprm i n pd 

UviVXillilivAJ. 


SFO 


TD 

IIS 


NO* 


784 


IS 

ID 


the 


d pfprm i n pd 

Uv Lvl llllllvU 


SFO 


TD 


NO- 


78S 


IS 


the 

111V 


d pfprm inpd 

U V l VI 111111VU 


SEO 


ID 


NO* 


786 


is 


the 

11 IV 


d pfprm inpd 

Uw I VI 111111 VU 


SEO 


ID 


NO* 


787 


is 


the 


determined 

UV I VI llllllvU 


SEO 


ID 


NO 


788 


is 


the 
iiiv 


determined 

Uvlvl llllllvU 


SEO 


ID 


NO* 


789 


is 


the 
mv 


determined 

UVIV1 111111VU 


SEO 


ID 


NO 


790 


is 


the 

11 IV 


determined 

UVIV1 111111VU 


SEO 


ID 


NO* 


791 

1 S L 


is 


the 


d eterm i n ed 

UVIV1 11111 1 VU 


SEO 


ID 

1J_/ 


NO 


792 


is 


the 


d pfprm i n pd 

UVIV1 111111VU 




ID 


INU: 


793 


is 


the 


determined 


SEQ 


ID 


NO: 


794 


is 


the 


determined 


SEQ 


ID 


NO: 


795 


is 


the 


determined 


SEQ 


ID 


NO: 


796 


is 


the 


determined 



cDNA sequence for clone R0094:B06. 
cDNA sequence for clone R0094:B08. 
cDNA sequence for clone R0094:B1 1. 
cDNA sequence for clone R0094:B12. 
cDNA sequence for clone R0094:C01. 
cDNA sequence for clone R0094:C02. 
cDNA sequence for clone R0094:C03. 
cDNA sequence for clone R0094:C05. 
cDNA sequence for clone R0094:C06. 
cDNA sequence for clone R0094:C08. 
cDNA sequence for clone R0094:C09. 
cDNA sequence for clone R0094:C10. 
cDNA sequence for clone R0094:C1 1. 
cDNA sequence for clone R0094:C12. 
cDNA sequence for clone R0094:D01. 
cDNA sequence for clone R0094:D02. 
cDNA sequence for clone R0094:D03. 
cDNA sequence for clone R0094:D04. 
cDNA sequence for clone R0094:D05. 
cDNA sequence for clone R0094:D07. 
cDNA sequence for clone R0094:D08. 
cDNA sequence for clone R0094:D09. 
cDNA sequence for clone R0094:D10. 
cDNA sequence for clone R0094:D12. 
cDNA sequence for clone R0094:E01. 
cDNA sequence for clone R0094:E02. 
cDNA sequence for clone R0094:E03. 
cDNA sequence for clone R0094:E05. 
cDNA sequence for clone R0094:E06. 
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TD 


NO- 


797 


Id 


the 

11 1C 


H pfprm i n pH 
ucici milieu. 


rDNA 


Qpniipnrp 
deu^ueiiee 


tfr*r rlrtnp 
iui eiuiic 


R0nQd-F07 

XVUU2/-T.XJV/ / . 


SEO 


TD 

XX-/ 


NO* 

IN v-/. 


798 


Id 


the 
me 


A pf prrn i n pH 
UCICI 11111 icu 


rDNA 


devj ueiiee 


fnr rlnnp 
iui enjiic 


T?0094*F08 


SEO 


TD 


NO* 


799 


i<; 


the 

Lilts 


H pterm i n pH 

UWLt/l 111111CU 


rDNA 


Qpmipnrp 


fnr rlnnp 
iui L/HJiie 


P0094F09 


SEO 


ID 


NO* 

1 t| v_/ > 


800 


is 


the 


H pterm i n pH 

UvlLl lllllll/li 


cDNA 

vX/lir\ 


Qpniipnrp 


fXr rlnnp 

Xv/X vlUllv 


R0094F10 


SEO 


TD 


NO* 


801 


id 


the 


H ptptm i n pH 

UCLC1 llllllCU 


rDNA 


Qpmipnrp 
oeu ueiiee 


fnr plnrtp 
xui eiuiie 


R0094F1 1 


SEO 


ID 


NO- 


802 




the 


(\ pt prm i n pH 

III/ L VI lillllVU 


cDNA 


Qeniienre 

OvUUvllvv 


fnr rlnnp 
x\ji eiv^iie 


R0094-F1 2 


SEO 


ID 


NO* 


803 


lo 


the 

LI 11/ 


H pterm i n eH 

Uwlvl llllilVsU 


cDNA 


Qpnnpnrp 


fnr rlnnp 
iui eiuiie 


R0094-F01 


SEO 


TD 


NO* 

in w. 


804 

Out 


Id 


the 

11 1C 


H pfprm mpH 

UCLClllllUCU 


rDNA 


Qpniipnrp 
oeL| ixeiiee 


fnr rlnnp 
iui eiuiic 


IvUU7*t .F V/J) . 


SEO 


ID 


NO* 


805 


id 


the 

Li IV 


deteimined 


cDNA 


qpmipnrp 


fnr rlnnp 
iui eiuiie 


R0094-F0S 


SEO 


TD 


NO* 
in \j . 


806 


id 


the 

LI 11/ 


(\ pterm ineH 

UC LCi lliiilCU 


cDNA 


Qpnnpnrp 
oclj ueiiee 


fnr rlnnp 
iui eiuiie 


R00Q4F06 


SEO 


TD 


NO* 

IN V_7. 


807 


id 


the 

LUC 


c\ pterm ineH 

Ut Lt/l llllllt'Ll 


rDNA 


qpmipnrp 
oci| ueiiee 


fnr rlnnp 
iui eiuiie 


R0094-F07 


SEO 


TD 


NO* 

IN w. 


808 


Id 


the 
ii i w 


HpferminpH 

Ld llllilvLl 


rDNA 


Qpmipnrp 
dcu^ ueiiee 


fnr rlnnp 
lkjl eiuiie 


R0094-F08 


SFO 


TD 


NO* 


809 


1 Q 
Ld 


the 

LUC 


H ptprm i n pH 

UC LCI llllIlCLl 


rDNA 


c pn 1 1 pn r* p 
ociJuciiec 


flnr r* 1 r\r\ p 

iui eiunc 


i\UU7T.r u*7. 


SFO 


TD 

IX-/ 


NO* 


810 


Ld 


the 

11 1C 


H pfprm i n pH 

IXC LCI 111111CU 


rDNA 


c pn i i pn c p 
ocu^ueiiee 


iui eiuiie 


ivuuyT.r iu. 


SFO 


TD 


NO* 

IN V-/. 


81 1 

Oil 


1 C 
Id 


the 

LUC 


H ptprm l n pH 
UCIC1 llllllCli 


rDNA 


c pn 1 1 pn r* p 

dCL[uciiec 


iui eiuiie 


R00Q4-F1 1 


SFO 


TD 


NO* 

IN W. 


81? 

OIL 


Ld 


the 

LUC 


H pfprm i n pH 
yJLV LCI llllllCU. 


rDNA 

UJL/1N r\ 


c pn 1 1 pn r* p 

dcu^ueiiee 


fIr\T plrtnp 

iui eiuxic 


R0nQ4-F1 9 
i\UU7*T.r i^. 


SFO 


TD 


NO* 

IN \J . 


81^ 


1 c 

Id 


the 

LUC 


H pfprm i n pn 

uc lci milieu 


rDNA 


c pn i i pn c p 
dCt[uciiec 


iui eiuiic 


T?onQ4-no? 


9FO 


TD 

Xl_7 


NO- 

IN \J . 


CIA 
o it 


1 c 
Ld 


the 

LUC 


UC LCI mnicu 


rDNA 


c pn 1 1 pn /~* p 

oet[Ueilee 


lor eiune 


XvVJU*7 i t . vJvJ . 


SFO 


TD 

11_7 


NO* 

IN 


81 S 


Id 


the 

LUC 


A pfprm ^ n 
uc lci mmcu 


rDNA 


c pn i i pn r* p 

deu^ueiiee 


iui eiuiie 


T?00Q4-nn4 


SFO 


TD 

11-/ 


NO* 


816 

O 1 U 


i C 
Id 


the 

LUC 


A pfprm i n pH 

uc lci mmcu 


rDNA 


c pn 1 1 pn r* p 

dcu^uciiec 


iui eiuiie 




SFO 


TD 

XXV 


NO* 


817 

O 1 / 


i C 

Id 


the 

LUC 


H pfprm i n pH 
ue lci milieu 


rDNA 


dcu^uenec 


fnr rlnnp 
iui eiuiie 


R00Q4*n07 


SFO 


TD 

XXV 


NO* 


818 

OlO 


Id 


the 

LUC 


A pfprm i n pH 
ue lci milieu 


rDNA 


cp/inpnrp 


fnr rlnnp 
iui eiunc 


R0094-G08 


SEO 


TD 

XXV 


NO* 

IN \J • 


819 

017 


Id 


the 

LUC 


H pfprm i n pH 
ueieimmeu 


cDNA 


Qpmipnrp 
oeu ueiiee 


fnr rlnnp 
iui eiuiie 


R0094G10 


SEO 


TD 


NO* 

IN W. 


820 


ic 

Id 


the 
nit/ 


H pfprm i rt pH 


rDNA 


Qpnnpnrp 
deu ueiiee 


fnr rlnnp 
iui eiuiie 


]? 0094*01 1 


SFO 


TD 

xxv 


NO* 


821 


Id 


the 

nit- 


H pfprm i n prl 
ueiei milieu 


rDNA 

L/l_ylN A 


dcu uciiec 


fnr rlnnp 
iui eiuiie 


]? 0094-019 




ID 


NO: 


822 


is 


the 


determined 


cDNA 


sequence 


for clone 


T» AAA A TTA1 

R0094:H01. 


SEQ 


ID 


NO: 


823 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0094:H03. 


SEQ 


ID 


NO: 


824 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0094:H04. 


SEQ 


ID 


NO: 


825 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0094:H05. 
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SEO 


ID 


NO- 


826 


1^ 


the 
nxv 


dptprrnined 

VXV L VI llllllVVX 


cDNA 


cpnnenre 

O VVJ Ll V11V V 


frvr rlnne 

LKJL VlWllV 


XVVJ vj Zs *T . XX V U . 


SEO 


ID 


NO* 


827 


i<! 


the 


deterrm ned 

VXV L VI 11111 IvU 


cDNA 


O VVJ IX VI IV V 


for rlnne 

XVI V1VJ11V 


R0094*H08 

X VVJ VJ *T . X X \J O . 


SEQ 


ID 


NO: 


828 


is 


the 


determined 

VXV I VX 1 1 llllvvl 


cDNA 


cAfi ii en re 

ijvU Uvllvv 


for rlone 

1U1 VXVsXXV 


R0094*H09 


SEO 


ID 


NO' 


829 


is 


the 


determined 

VXV L VI xxxxxxvvx 


cDNA 


^ennenrp 

O VVJ IX VI IV V 


for rlnne 

XVI V 1 Wl 1 V 


R0094*H10 


SEO 


ID 


NO* 


830 


is 


the 


determined 

VXV L VI 11111 1 VVX 


cDNA 


^pnnpnrp 

O VVJ VX VI XV V 


for rlnne 

XVI V1U11V 


R0094-H1 1 

XV\/VJ^*T.XX 1 X. 


SEO 


ID 

XX~S 


NO* 

x^i \y . 


831 


io 


the 

11 XV 


d etprm i n pd 

VXVLVX XXX111VVX 


cDNA 


Qpnnpnre 

OVVJ^UVllVV 


fnr plnnp 
xvi vivJiiv 


R009S-A03 


SEO 


TD 


NO- 

x> vy . 


OJZ. 




the 

lllv 


d ptprm i n pd 

ViVLVl llllllVVX 


rDNA 


ovv^UvllVv 


fnr r*1r\np 
lUI VvlUllv 


R00QS- AOiS 


SEO 


ID 


NO- 

X^l v_/ . 


833 


i<i 


the 

11 XV 


d ptpim i n pd 

Uv LVX XXXXXXVVX 


cDNA 


Qpnnpnre 

OVVJLlVilVV 


fnr plnnp 

XUi VlVJllV 


R009S-A07 

xvVJvJ^ J .rWJ 1 . 


SEO 


ID 


NO- 

XI \_/ . 


834 


i<\ 

IO 


the 


d eterm i n ed 

VXV L VI XXXXXXVVX 


cDNA 


^pnnpnrp 

OVVJIXVIIVV 


fnr rlnnp 

XVJl V1WX1V 


R009S*F*01 

Xv\JvJj/-J .xJ VJ 1 . 


SEO 


ID 


NO' 


835 


is 


the 


determined 

Uv I VI 11 111 1 VVX 


cDNA 


O VVJ LX VI I V V 


for rlnne 

XV/X ViV^XXV 


R009S*R02 


SEO 


ID 


NO' 


836 


is 


the 


determined 

VXV I. VI 111111VVX 


cDNA 


^pmienrp 

O VVJ IXV11VV 


fnr rlnne 

XVJl V1VJI1V 


R009*vR03 


SEO 


TD 


NO* 


837 


io 


the 

LXXV 


d ptprm i n pd 

Uvlvl 1X1111VVX 


rDNA 


Qpnnptirp 

OVVJ U.V11VV 


fnr rlnnp 

L\JL VlVjllV 




SEO 


TD 


NO* 


838 


Id 


the 
mv 


d ptprm i n pd 

VXVLVX llllllVVX 


rDNA 


O VVJ LXV11VV 


fnr rlnnp 

XVJl VlVjllV 




SEO 


TD 


NO- 

x^i vy . 


839 


i^ 


the 


d ptprm i n pd 

VXV LV1 lllillVU 


cDNA 


Qpniienpp 

O VVJ U.V11VV 


fnr plnnp 

L\JL VlVjllV 




SEO 


ID 


NO* 

x^t vy . 


840 




the 

LXXV 


d ptprm in pd 

VXV L VI llllllVVX 


cDNA 


sipniipnpp 

O VVJ LX V11VV 


fnr rlnnp 

1U1 V1VJ11V 


XvVJvJZ' J .XJ 1 VJ. 


SEO 


ID 

XX-/ 


NO* 

X> \J , 


841 

1 




the 

IX1V 


d ptprm i n pd 

VXV L VI llllllVVX 


cDNA 


o Vvv| UVllV V 


fnr plnnp 

1VJ1 VlVJllV 


XvVJVJ^w" .J_> 1 I . 


SEO 


ID 

XX7 


NO- 


842 


i^ 


the 

LI IV 


d pterm i n pd 

VXV L VI llllllVVX 


cDNA 


o ViJ UVllV V 


fnr plnnp 

XVJl VlVJllV 


XvVJVJj? J .DlL. 


SFO 


TD 

XX-/ 


NO- 


843 


1 Q 


the 

L11V 


d ptprm i ti ed 

VXV LVl llllllVVX 


rDNA 


Qpniipnpp 

OVU UVllVV 


fnr r*1nnp 

LSJI VlVJllV 






TD 


NO 


844 


lO 


the 

lilt 


vxvlvl lllllivvl 


rDNA 


OVV^UCllVvV 


fr\r r* 1 
1U1 vlUllv 


xVVjU^»J . KsKJD . 




TD 


NO 


84S 


1 C 


the 

lllv 


H ptf a T*tt*i i n 
VIC Lvl lllllivvl 


rfiMA 
vl_ylN/\ 


ovL[Uvilvv 


1UI vlUIlv 




SFO 


TD 

xx_y 


NO* 


846 


1 Q 


the 

L11V 


d pterm i n pd 

VXV LVl llllllVVX 


rDNA 


cpnnpnpp 

O VU W VllVV 


fnr plnnp 

lUI V1U11V 


XVVJV/-7 J . VvVJ«J, 


SEO 


TD 


NO* 


847 




the 

LXXV 


d pterm i n pd 

VXVLVX llllllVVX 


cDNA 


cpniipnrp 

O VV^ UVllV V 


fnr plnnp 

lvJl VlVJllV 




SEO 


ID 


NO- 


848 

kJ~0 


\<i 
IO 


the 

LX1V 


d ptprm i n pd 

VXV LVl 11 1111 VVX 


cDNA 


Qpniipnre 

OVVJUVllVV 


fnr plnnp 

XVJl VlVJllV 


i\.\j\jy*j . v^vj / . 


SEO 

kJ J— / 


ID 


NO- 


849 


IO 


the 

LX1V 


detprmined 

Uvlvl llllllVVX 


cDNA 


QPflllPTlPP 
OVU UVllV V 


fnr rlnne 
xvji vivjiiv 


R009S*r08 


SEO 

kJi-/V^ 


TD 


NO* 

IN vy . 


8^0 


Io 


the 


d eterm i n pd 

UVlVl llllllVVX 


cDNA 


QpniiPTipp 

OVU UVllVV 


fnr flnnp 
1U1 VlvJllv 


R00QS-nO 

ISJJVjy J .V/ 1 \J. 


bEQ 


TT"\ 
ID 


NO: 


851 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0095:Cl2. 


SEQ 


ID 


NO: 


852 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0095:D0l. 


SEQ 


ID 


NO: 


853 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0095:D03. 


SEQ 


ID 


NO: 


854 


is 


the 


determined 


cDNA 


sequence 


for clone 


R0095:D04. 



210121.471C11 



SEO 


ID 


NO- 


855 


19 
io 


the 


H eterm i n eH 

UVLV1 llllllVU 


SEO 


ID 

LIS 


NO* 


856 




the 

LI IV 


H eterm i n eH 

UVLV1 llllllVU 


SEO 


TD 


NO* 


857 


19 


the 

11 IV 


H eterm i n eH 

UVLVL llllllVU 


SEO 


TD 


NO- 

I> V./. 


858 


19 
lo 


the 

tills 


H eterm ineH 

VJ VI VI llllllVU 


SEO 


TD 


NO- 


859 


19 


the 


HetprmineH 

UVlVl llllllVU 


SEO 


TD 


NO- 


860 

OUU 


1*9 
lo 


the 

LI IV 


HetprminpH 
uviviiiniivu 


SFO 


TD 


NO- 


861 


ic 


the 

lllv 


HpfprmmpH 
u> v I vl llllllVU 


SEO 


TD 


NO 


86? 




the 

lllv 


A ptprrn in pH 
Uvlvl llllllVU 


SEO 


TD 


NO- 


86^ 


19 
lo 


the 

Li IV 


HptPTtninpH 

UC I CI llllllVU 


SEO 


TD 


NO* 


864 
out 


19 

lo 


the 

11 IV 


H ptprm inpH 

UV l Vl llllllVU 


SEO 


TD 


NO- 


865 

OU J 


i c 

lo 


the 

lllv 


A pf pnn i n pH 

UvlVlllllllvU 


SEO 


TD 


NO- 


866 
ouu 


1 Q 
lo 


the 

11 lv 


H ptprm i n pH 

Uw I Vl llllllVU 


SEO 


TD 


NO- 
i> w. 


867 

OU / 


lo 


the 

lllv 


H ptprm i n pH 
u> v ivi iiiiiivu 


SEO 


TD 


NO* 


868 

OUO 


1 C 
lo 


the 
lllv 


A ptprm i n pH 

UVlVl llllllVU 


SEO 


TD 


NO* 


869 


1 9 
lo 


the 


H ptprm i n pH 

UWlvl llllllVU 


SEO 


TD 


NO* 


870 

O / \J 


19 
lo 


the 

LliV 


H ptprm in pH 

UVlVl llllllVU 


SEO 


TD 


NO- 


871 

O / 1 


19 
lo 


the 

lllv 


n ptPTTTi i n pH 

UVlVl llllllVU 


SEO 


TD 


NO- 


87? 

O / 


ic 

lo 


the 
lllv 


H ptprm in pH 

UVlVl llllllVU 


SEO 


TD 


NO- 


87^ 


ic 
lo 


thp 

lllv 


A pfprm i n pH 
U v I vl 111111 vU 


SEO 


TD 


NO* 


Rid 

O / *T 


lo 


thp 

lllv 


H ptprm in pH 
UC L Vl llllllVU 


SEO 


TD 


NO* 


875 


1 c 
lo 


thp 

lllv 


A ptprm i n pH 
UC Ivi illiiivU 


SEO 


TD 


NO- 


876 

O / u 


1 Q 

lo 


the 
lllv 


H ptprm i n pH 

UvlVlllllllvU 


SEO 


TD 


NO- 


877 

Oft 


1 C 

lo 


the 
lllv 


A pfpt*m i n pH 

UVlVl llllllVU 


SFO 


TD 


NO- 


o / o 


1 C 

lo 


thp 
lllv 


A PTPTTT1 1 fl pH 
Uvlvl llllllVU 


SEO 


TD 


NO* 

IN \J . 


879 

o f y 


1 c 
lo 


the 
lllv 


A ptprm i n pH 

Uvlvl llllllVU 


SEQ 


TT"V 

ID 


NO: 


880 


is 


the 


determined 


SEQ 


ID 


NO: 


881 


is 


the 


determined 


SEQ 


ID 


NO: 


882 


is 


the 


determined 


SEQ 


ID 


NO: 


883 


is 


the 


determined 



cDNA sequence for clone R0095:D06. 
cDNA sequence for clone R0095:D07. 
cDNA sequence for clone R0095:D08. 
cDNA sequence for clone R0095:D09. 
cDNA sequence for clone R0095:D1 1. 
cDNA sequence for clone R0095:D12. 
cDNA sequence for clone R0095:E01. 
cDNA sequence for clone R0095:E02. 
cDNA sequence for clone R0095:E04. 
cDNA sequence for clone R0095:E05. 
cDNA sequence for clone R0095:E06. 
cDNA sequence for clone R0095:E07. 
cDNA sequence for clone R0095:E08. 
cDNA sequence for clone R0095:E1 1. 
cDNA sequence for clone R0095:E12. 
cDNA sequence for clone R0095:F01. 
cDNA sequence for clone R0095:F03. 
cDNA sequence for clone R0095:F06. 
cDNA sequence for clone R0095:F10. 
cDNA sequence for clone R0095:F1 1. 
cDNA sequence for clone R0095:G02. 
cDNA sequence for clone R0095:G03. 
cDNA sequence for clone R0095:G04. 
cDNA sequence for clone R0095:G08. 
cDNA sequence for clone R0095:G09. 
cDNA sequence for clone R0095:G10. 
cDNA sequence for clone R0095:H01. 
cDNA sequence for clone R0095:H02. 
cDNA sequence for clone R0095:H04. 
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SFO 


TD 


NO* 


884 

OQ*T 


ic ttip 

lo LllC 


H ptprm i n pH 
UCLCllIllIlCU 


SFO 


TD 

IX/ 


NO* 


88S 


1Q tViP 
lo lilt/ 


n p t prm i n pH 


SFO 


TD 


NO* 

IN \J. 


886 

00\J 


ic trip 


H ptprm i n pH 
UCLCllIllIlCU 


SFO 


TD 


NO* 


887 


IC tV»P 

lo Lilt 


H p t prm i n pH 

UCLC1 I11111CU 


SFO 


TD 


NO* 
in \j . 


888 
ooo 


IC tVlP 
lo 11 IC 


H pfprm i n Pfi 

UCICi II11I1CU 


SFO 


TD 


NO* 


88Q 


IC tllP 
lo Lilt/ 


H pfprm i n pH 

UCIC1 111111CU 


SFO 


TD 


NO* 


8Q0 


ic trip 

lo LUC 


r\ ptPTTri i n Pfi 
UCLC1 111111CU 


SFO 


TD 


NO* 

1N"^. 


RQ1 


1 C trip 

lo LllC 


m ptpT*m i n pH 

ucicriiimcu 


SFO 


th 


NO- 


RQ? 
oyz 


lo Lnc 


uciemiincu 


SFO 


TD 


NO- 


07J 


1 C trip 

lo Lnc 


s\ ptPTTV1 1 t*l P/l 

UCLCiiilinCU 




m 


NO- 


RQd 


is the 


UclcrinincQ 


SFO 


TD 


NO- 

IN \J. 


RQS 


i o tnp 
lo LllC 


uclci milieu 


SFO 


TD 


NO- 


RQ6 


1 C tVlP 

lo LllC 


/I PtPttYl 1 tl Pfl 

UCLCI II11I1CU 


SFO 


TD 


NO- 
1NVJ. 


RQ7 


1 c? t rlP 

lo inc 


ueieiTnineu 


SFO 


TD 


NO- 

IN 


ROR 

070 


1 O tVlP 

lo LllC 


uc iciiinncu 


SFO 


TD 


NO- 

IN \J. 


8QQ 

oyy 


1 C tllP 

lo LllC 


UC LCillllllCU 


SFO 


TD 


NO- 

IN Vy. 


ODD 


i c tnp 
lo LllC 


r\ pfprm i y\ prl 
UC LCI 1I11I1CU 


SFO 


TD 


NO- 


001 


i c trip 
lo LllC 


UCLCllIllIlCU 


SFO 


TD 


NO- 


Q09 
y\j£ 


i o trip 
lo LllC 


r\ pfprm i n pH 

ucLcniuiicu 


SFO 


TD 


NO- 




1 C tVlP 

lo Lnc 


r\ pfprm i *i pn 

uclci imncu 


SFO 


TD 


NO- 


7U-+ 


ic trip 
lo LllC 


f\ ptprm i n pH 
UCLCllIllIlCU 


SFO 


TD 


NO* 




lo LllC 


pfprm i n p/i 
UCLCllIllIlCU 


SFO 


TD 


NO- 

IN \J. 


7uO 


i c trip 
lo LllC 


nPtPltYlinpH 

UCLCllIllIlCU 


SFO 


JT\ 
LU 


NO- 
IN \J* 


QH7 
yvj / 


is the 


Ucicimincu 


SFO 


TD 


NO- 


QOR 


is the 


Ucicimincu 


SEQ 


ID 


NO: 


909 


is the 


determined 


SEQ 


ID 


NO: 


910 


is the 


determined 


SEQ 


ID 


NO: 


911 


is the 


determined 


SEQ 


ID 


NO: 


912 


is the 


determined 



cDNA sequence for clone R0095:H06. 
cDNA sequence for clone R0095:H07. 
cDNA sequence for clone R0095:H09. 
cDNA sequence for clone R0096:A02. 
cDNA sequence for clone R0096:A08. 
cDNA sequence for clone R0096:A09. 
cDNA sequence for clone R0096:A10. 
cDNA sequence for clone R0096:A1 1. 
cDNA sequence for clone R0096:A12. 
cDNA sequence for clone R0096:B02. 
cDNA sequence for clone R0096:B03. 
cDNA sequence for clone R0096:B04. 
cDNA sequence for clone R0096:B05. 
cDNA sequence for clone R0096:B06. 
cDNA sequence for clone R0096:B07. 
cDNA sequence for clone R0096:B08. 
cDNA sequence for clone R0096:B09. 
cDNA sequence for clone R0096:B10. 
cDNA sequence for clone R0096:B1 1. 
cDNA sequence for clone R0096:B12. 
cDNA sequence for clone R0096:C01. 
cDNA sequence for clone R0096:C03. 
cDNA sequence for clone R0096:C04. 
cDNA sequence for clone R0096:C05. 
cDNA sequence for clone R0096:C06. 
cDNA sequence for clone R0096:C07. 
cDNA sequence for clone R0096:C08. 
cDNA sequence for clone R0096:C09. 
cDNA sequence for clone R0096:C10. 
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the 
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cDNA sequence for clone R0096:C1 1. 
cDNA sequence for clone R0096:C12. 
cDNA sequence for clone R0096:D01. 
cDNA sequence for clone R0096:D02. 
cDNA sequence for clone R0096:D03. 
cDNA sequence for clone R0096:D04. 
cDNA sequence for clone R0096:D05. 
cDNA sequence for clone R0096:D08. 
cDNA sequence for clone R0096:D09. 
cDNA sequence for clone R0096:D10. 
cDNA sequence for clone R0096:D12. 
cDNA sequence for clone R0096:E01. 
cDNA sequence for clone R0096:E02. 
cDNA sequence for clone R0096:E03. 
cDNA sequence for clone R0096:E04. 
cDNA sequence for clone R0096:E05. 
cDNA sequence for clone R0096:E06. 
cDNA sequence for clone R0096:E08. 
cDNA sequence for clone R0096:E09. 
cDNA sequence for clone R0096:E10. 
cDNA sequence for clone R0096:E1 1. 
cDNA sequence for clone R0096:E12. 
cDNA sequence for clone R0096:F01. 
cDNA sequence for clone R0096:F02. 
cDNA sequence for clone R0096:F03. 
cDNA sequence for clone R0096:F04. 
cDNA sequence for clone R0096:F05. 
cDNA sequence for clone R0096:F07. 
cDNA sequence for clone R0096:F10. 
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967 


is the 


determined 


SEQ 


ID 


NO: 
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is the 
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cDNA sequence for clone R0096:F11. 
cDNA sequence for clone R0096:G01. 
cDNA sequence for clone R0096:G03. 
cDNA sequence for clone R0096:G04. 
cDNA sequence for clone R0096:G05. 
cDNA sequence for clone R0096:G06. 
cDNA sequence for clone R0096:G07. 
cDNA sequence for clone R0096:G09. 
cDNA sequence for clone R0096:G10. 
cDNA sequence for clone R0096:G12. 
cDNA sequence for clone R0096:H01. 
cDNA sequence for clone R0096:H02. 
cDNA sequence for clone R0096:H03. 
cDNA sequence for clone R0096:H07. 
cDNA sequence for clone R0096:H08. 
cDNA sequence for clone R0097:A05. 
cDNA sequence for clone R0097:A06. 
cDNA sequence for clone R0097:A10. 
cDNA sequence for clone R0097:A1 1 . 
cDNA sequence for clone R0097:B01. 
cDNA sequence for clone R0097:B03. 
cDNA sequence for clone R0097:B04. 
cDNA sequence for clone R0097:B05. 
cDNA sequence for clone R0097:B06. 
cDNA sequence for clone R0097:B07. 
cDNA sequence for clone R0097:B1 1. 
cDNA sequence for clone R0097:C01. 
cDNA sequence for clone R0097:C02. 
cDNA sequence for clone R0097:C03. 
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for plonp 
hjx t/iuixc 


000S-F01 


SFO 


TD 

Lis 


NO* 

1>I Vv. 


1018 


ic the 

lO Lilt 


f\ ptprm i n pH 

VlGlCllIilllCVl 


rDNA 


cpniipfipp 

oCiJ UCIlt/t/ 


1UX L/1UI1C 


xxU U V O . 13U t T . 


SFO 


TD 


NO* 


1019 


ic the 

lO lilt/ 


d eterm i n ed 

LlCLt/ilillllt/tl 


rDNA 


oCLJUCllt/C 


for plnnp 

LKJL L/1UIXC 


R00QR-F0S 


SEO 


TD 

XXV 


NO* 


1020 


iq the 

lO Lilt/ 


determined 

tlt/lt/llllllICtl 


rDNA 


qennpnrp 

Ot/tJ IXt/llt/t/ 


for rlonp 


ROOQR-FOfi 


SFO 

Ji. /VV 


TD 

XXV 


NO* 


1021 


ic the 

lO lilt/ 


d eterm i n eH 
tit/it/i iimit'ti 


rDNA 

t/XV I N i\ 


Qpniipnpp 


for rlonp 

XtJl l/XUXXC 


R00QR-F07 

xVV/UvO .Lj*\J 1 . 


SEO 


ID 

XXX 


NO- 


1022 


iq the 

ID Lilt/ 


d eterm i n ed 
tit>it/i iimit/ti 


cDNA 

t/XVl^l/T. 


qennenre 

OvU Ut/llt't/ 


for rlone 

1U1 t/lt/llt' 


R00QRF1 1 

J\UU70.Iw X X . 


SEO 


TD 

XXV 


NO* 


1023 


iq the 

lO Lilt/ 


d eterm i n ed 
tit/ it/ii in lit/ ti 


cDNA 

tfXVl ti A 


qennenrp 

Ot>tJUdlt/C 


for rlonp 

1 Ul t'XWXXC 


R00QRF04 

XVV/V/vO .XT V/*t. 


SEO 


ID 

X.IV 


NO* 


1024 


iq the 

ID lilt/ 


determined 
ti t/ 1 t/i 1 1 1 int/ti 


rDNA 


qennenre 


for rlone 

LKJL t/Xv/llt/ 


R00QR-F0S 

XXV/V/ vO .J/ V/ J . 


SEQ 


ID 


NO: 


1 C 

1025 


is the 


determined 


cDNA 


sequence 


for clone 


R0098:F06. 


SEQ 


ID 


NO: 


1026 


is the 


determined 


cDNA 


sequence 


for clone 


R0098:F07. 


SEQ 


ID 


NO: 


1027 


is the 


determined 


cDNA 


sequence 


for clone 


R0098:F08. 


SEQ 


ID 


NO: 


1028 


is the 


determined 


cDNA 


sequence 


for clone 


R0098:F09. 
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SEQ 


ID NO: 


1029 


is the 


determined 


cDNA 


sequence 


for clone 


R0098:F10. 






SEQ 


ID NO: 


1030 


is the 


determined 


cDNA 


sequence 


for clone 


R0098:F11. 






SEQ 


ID NO: 


1031 


is the 


determined 


cDNA 


sequence 


for clone 


R0098F12 






SEQ 


ID NO: 


1032 


is the 


determined 


cDNA 


seauence 


for clone 

ivl V*s 1V1 1 V/ 


R0098G02 

IVV V *S V • \J VA« • 




5 


SEQ 


ID NO: 


1033 


is the 


determined 


cDNA 


seauence 


for clone 


R0098G03 






SEQ 


ID NO: 


1034 


is the 


determined 


cDNA 


seauence 

CV/Vi UVllvv 


for clone 

XVX VlVilV 


R0098G05 






SEQ 


ID NO: 


1035 


is the 


determined 


cDNA 


seauence 


for clone 

XvX V1V11V 


R0098G06 

lvv \J \J • V V V « 






SEQ 


ID NO: 


1036 


is the 


determined 


cDNA 


seauence 

Vu tiVllvV 


for clone 

XVI vlvllv 


R0098G07 






SEQ 


ID NO: 


1037 


is the 


determined 

VXV kVlllllllVU 


cDNA 


seauence 

JvU UVllvv 


for clone 

Ivi vlvllv 


R0098G08 




10 


SEQ 


ID NO: 


1038 


is the 


determined 

V* V Ivl llllilvU 


cDNA 


seauence 

JVU 14 wl Iv V 


for clone 

1 VI vlvllv 


R0098 G09 






SEQ 


ID NO: 


1039 


is the 


determined 


cDNA 


seauence 

JvU UVllVV 


for clone 

1U1 vlvllv 


R0098 G10 

i\v v t> . VJ i v. 






SEQ 


ID NO: 


1040 


is the 


determined 

V,*- IVI 11111 Ivvi 


cDNA 


seauence 


for clone 

1V1 vlvllv 


R0098 G11 

ivu \y U • v_J l l ■ 






SEQ 


ID NO: 


1041 


is the 


determined 


cDNA 


seauence 

LjV/V| Uvllv \* 


for clone 

X 1 Vlvllv 


R0098G12 

XV V V -X V • V— J 1 « 






SEQ 


ID NO: 


1042 


is the 


determined 


cDNA 


seauence 


for clone 

XVI vlvllv 


R0098H02 


N 1 


15 


SEQ 


ID NO: 


1043 


is the 


determined 


cDNA 


seauence 

tJw^VI Wl lv V 


for clone 

XVI VIvllV 


R0098H03 

iw/v ^ v • x x vy -J . 






SEQ 


ID NO: 


1044 


is the 


determined 


cDNA 


seauence 

JVU UVllvv 


for clone 

Ivl V Ivl IV 


R0098H04 

l\vV/y v . 1 1U~. 


H 






TD NO* 


1 04 S 


io LUC 


U-CLCilIlIIlvu 




oCC[UCIlvC 


1UT vlUUC 








SEQ 


ID NO: 


1046 


is the 


determined 


cDNA 


sequence 


for clone 


R0098:H07. 






SEQ 


ID NO: 


1047 


is the 


determined 


cDNA 


sequence 


for clone 


R0098:H08. 


S— i 

s i 


20 


SEQ 


ID NO: 


1048 


is the 


determined 


cDNA 


sequence 


for clone 


R0098:H11. 



SEQ ID NO: 1049 is the determined cDNA sequence for clone C878P 
which shows sequence similarity to homo sapiens cDNA FLJ10884 fis, clone 
NT2RP4001950 and homo sapiens cDNA FLJ1 1111 fis, clone PLACE1 005923. 

SEQ ID NO: 1050 is the determined cDNA sequence for clone C882P which shows 
25 sequence similarity to homo sapiens cDNA FLJ20116 fis, clone COLO 5655 and homo 
sapiens cDNA FLJ20740 fis, clone HEP071 18. 

SEQ ID NO: 1051 is the determined cDNA sequence for clone C883P which shows 
sequence similarity to human homeobox protein Cdx2 mRNA. 
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SEQ ID NO: 1052 is the determined cDNA sequence for clone C884P which shows 
sequence similarity to human TM4SF3 (aka, CO-029). 

SEQ ID NO: 1053 is the determined cDNA sequence for clone C886P which shows 
sequence similarity to human secretory protein (Pl.B) mRNA and homo sapiens trefoil 
factor 3 (intestinal) (TFF3) mRNA. 

SEQ ID NO: 1054 is the determined cDNA sequence for clone C892P which shows 
sequence similarity to human galectin-4 mRNA. 

SEQ ID NO: 1055 is the determined cDNA sequence for clone C900P which shows 
sequence similarity to homo sapiens mucin 1 1 (MUC1 1) mRNA. 

SEQ ID NO: 1056 is the determined cDNA sequence for clone C902P which shows 
sequence similarity to homo sapiens calcium-dependent chloride channel- 1 (hCLCAl) 
mRNA. 

SEQ ID NO: 1057 is the determined cDNA sequence for clone C903P which shows 
sequence similarity to homo sapiens transmembrane mucin 12 (MUC12) mRNA. 

SEQ ID NO: 1058 is the determined cDNA sequence for clone C899P which shows 
sequence similarity to homo sapiens intestinal mucin (MUC2) mRNA. 

SEQ ID NO: 1059 is the predicted amino acid sequence for the clone of SEQ ID 
NO: 1049. 

SEQ ID NO: 1060 is the predicted amino acid sequence for the clone of SEQ ID 
NO: 1050. 

SEQ ID NO: 1061 is the predicted amino acid sequence for the clone of SEQ ID 
NO:1051. 

SEQ ID NO: 1062 is the predicted amino acid sequence for the clone of SEQ ID 
NO:1052. 

SEQ ID NO: 1063 is the predicted amino acid sequence for the clone of SEQ ID 
NO:1053. 

SEQ ID NO: 1064 is the predicted amino acid sequence for the clone of SEQ ID 
NO:1054. 
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SEQ ID NO: 1065 is the predicted amino acid sequence for the clone of SEQ ID 
NO:1055. 

SEQ ED NO: 1066 is the predicted amino acid sequence for the clone of SEQ ED 
NO:1056. 

5 SEQ ID NO: 1067 is the predicted amino acid sequence for the clone of SEQ ID 

NO:1057. 

SEQ ID NO: 1068 is the predicted amino acid sequence for the clone of SEQ ID 
NO:1058. 

SEQ ID NO:1069 is the full length nucleotide sequence for clone CS1-152 (C880P, 
10 C887P). 

SEQ ID NO: 1070 is the predicted amino acid sequence for the clone of SEQ ID 
NO: 1069. 

SEQ ID NO: 1071 is the cDNA sequence for human colon specific gene (geneseq 
X03195) identified from a computer search of the public geneseq database and which 
15 shows similarity to clone C880P. 

SEQ ID NO: 1072 is the cDNA sequence for human protein comprising secretory 
signal nucleotide sequence 3 (geneseq V29035) identified from a computer search of the 
public geneseq database and which shows similarity to clone C880P. 

SEQ ED NO: 1073 is the cDNA sequence for open reading frame human protein 
20 comprising secretory signal 3 (geneseq V29036) identified from a computer search of the 
public geneseq database and which shows similarity to clone C880P. 

SEQ ID NO: 1074 is the cDNA sequence for human colon specific protein cDNA 
(geneseq T51784) identified from a computer search of the public geneseq database and 
which shows similarity to clone C880P. 
25 SEQ ID NO: 1075 is the cDNA sequence for human Reg 1 -gamma protein (geneseq 

V29156) identified from a computer search of the public geneseq database and which 
shows similarity to clone C880P. 

SEQ ID NO: 1076 is the cDNA sequence for human intestinal peptide-associated 
transporter HPT-1 mRNA, complete cds and homo sapiens mRNA for Ll-cadherin 
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(geneseq XI 8 166) identified from a computer search of the public geneseq database and 
which shows similarity to clone C888P. 

SEQ ID NO: 1077 is the amino acid sequence of geneseq record W12691 which 
shows sequence similarity to clone C880P. 
5 SEQ ED NO: 1078 is the amino acid sequence of geneseq record W37866 which 

shows sequence similarity to clone C880P. 

SEQ ED NO: 1079 is the amino acid sequence of geneseq record W37929 which 
shows sequence similarity to clone C880P. 

SEQ ED NO: 1080 is the amino acid sequence of geneseq record W84274 which 
10 shows sequence similarity to clone C880P. 

SEQ ED NO: 1081 is the amino acid sequence of geneseq record W740898 which 
shows sequence similarity to clone C888P. 

SEQ ID NO: 1082 is the determined cDNA sequence for clone 27540 

SEQ ID NO:1083 is the predicted amino acid sequence of clone 27540 (SEQ ID 
15 NO: 1082) 

DETAILED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is generally directed to compositions 
and methods for the therapy and diagnosis of cancer, such as colon cancer. The 
compositions described herein may include colon tumor polypeptides, polynucleotides 

20 encoding such polypeptides, binding agents such as antibodies, antigen presenting cells 
(APCs) and/or immune system cells (e.g., T cells). Polypeptides of the present invention 
generally comprise at least a portion (such as an immunogenic portion) of a colon tumor 
protein or a variant thereof. A "colon tumor protein" is a protein that is expressed in colon 
tumor cells at a level that is at least two fold, and preferably at least five fold, greater than 

25 the level of expression in a normal tissue, as determined using a representative assay 
provided herein. Certain colon tumor proteins are tumor proteins that react detectably 
(within an immunoassay, such as an ELISA or Western blot) with antisera of a patient 
afflicted with colon cancer. Polynucleotides of the subject invention generally comprise a 
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DNA or RNA sequence that encodes all or a portion of such a polypeptide, or that is 
complementary to such a sequence. Antibodies are generally immune system proteins, or 
antigen-binding fragments thereof, that are capable of binding to a polypeptide as described 
above. Antigen presenting cells include dendritic cells, macrophages, monocytes, 

5 fibroblasts and B-cells that express a polypeptide as described above. T cells that may be 
employed within such compositions are generally T cells that are specific for a polypeptide 
as described above. 

The present invention is based on the discovery of human colon tumor 
proteins. Sequences of polynucleotides encoding specific tumor proteins are provided in 

10 SEQ ID NO: 1-121, 123-197, 205-630 and 632-684, 686, 690-691, and 694-1081. 

Colon Tumor Protein Polynucleotides 

Any polynucleotide that encodes a colon tumor protein or a portion or other 
variant thereof as described herein is encompassed by the present invention. Preferred 
polynucleotides comprise at least 15 consecutive nucleotides, preferably at least 30 
consecutive nucleotides and more preferably at least 45 consecutive nucleotides, that 
encode a portion of a colon tumor protein. More preferably, a polynucleotide encodes an 
immunogenic portion of a colon tumor protein. Polynucleotides complementary to any 
such sequences are also encompassed by the present invention. Polynucleotides may be 
single-stranded (coding or antisense) or double-stranded, and may be DNA (genomic, 
cDNA or synthetic) or RNA molecules. RNA molecules include HnRNA molecules, 
which contain introns and correspond to a DNA molecule in a one-to-one manner, and 
mRNA molecules, which do not contain introns. Additional coding or non-coding 
sequences may, but need not, be present within a polynucleotide of the present invention, 
and a polynucleotide may, but need not, be linked to other molecules and/or support 
materials. 

Polynucleotides may comprise a native sequence (i.e., an endogenous 
sequence that encodes a colon tumor protein or a portion thereof) or may comprise a variant 
of such a sequence. Polynucleotide variants may contain one or more substitutions, 
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additions, deletions and/or insertions such that the immunogenicity of the encoded 
polypeptide is not diminished, relative to a native tumor protein. The effect on the 
immunogenicity of the encoded polypeptide may generally be assessed as described herein. 
Variants preferably exhibit at least about 70% identity, more preferably at least about 80% 
5 identity and most preferably at least about 90% identity to a polynucleotide sequence that 
encodes a native colon tumor protein or a portion thereof. 

Two polynucleotide or polypeptide sequences are said to be "identical" if 
the sequence of nucleotides or amino acids in the two sequences is the same when aligned 
for maximum correspondence as described below. Comparisons between two sequences 
10 are typically performed by comparing the sequences over a comparison window to identify 
and compare local regions of sequence similarity. A "comparison window" as used herein, 
P refers to a segment of at least about 20 contiguous positions, usually 30 to about 75, in 

™ which a sequence may be compared to a reference sequence of the same number of 

+= contiguous positions after the two sequences are optimally aligned. 

ffl 15 Optimal alignment of sequences for comparison may be conducted using the 

L Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, Inc., 

L, Madison, WI), using default parameters. This program embodies several alignment 

o 

CO schemes described in the following references: Dayhoff, M.O. (1978) A model of 

•) y 

fQ evolutionary change in proteins - Matrices for detecting distant relationships. In Dayhoff, 

~ 20 M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical Research 

Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) Unified 
Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology vol. 183, 
Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) CABIOS 
5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11-17; Robinson, E.D. (1971) 
25 Comb. Theor 77:105; Santou, N. Nes, M. (1987) Mol. Biol. Evol. 4:406-425; Sneath, 
P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and Practice of 
Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and Lipman, D.J. 
(1983) Proc. Natl. Acad., Sci. USA 50:726-730. 
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Preferably, the "percentage of sequence identity" is determined by 
comparing two optimally aligned sequences over a window of comparison of at least 20 
positions, wherein the portion of the polynucleotide or polypeptide sequence in the 
comparison window may comprise additions or deletions (i.e. gaps) of 20 percent or less, 
usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference sequence (which 
does not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is calculated by determining the number of positions at which the identical 
nucleic acid bases or amino acid residue occurs in both sequences to yield the number of 
matched positions, dividing the number of matched positions by the total number of 
positions in the reference sequence (i.e. the window size) and multiplying the results by 
100 to yield the percentage of sequence identity. 

Variants may also, or alternatively, be substantially homologous to a native 
gene, or a portion or complement thereof. Such polynucleotide variants are capable of 
hybridizing under moderately stringent conditions to a naturally occurring DNA sequence 
encoding a native colon tumor protein (or a complementary sequence). Suitable 
moderately stringent conditions include prewashing in a solution of 5 X SSC, 0.5% SDS, 
1.0 mM EDTA (pH 8.0); hybridizing at 50°C-65°C, 5 X SSC, overnight; followed by 
washing twice at 65°C for 20 minutes with each of 2X, 0.5X and 0.2X SSC containing 
0.1% SDS. 

It will be appreciated by those of ordinary skill in the art that, as a result of 
the degeneracy of the genetic code, there are many nucleotide sequences that encode a 
polypeptide as described herein. Some of these polynucleotides bear minimal homology to 
the nucleotide sequence of any native gene. Nonetheless, polynucleotides that vary due to 
differences in codon usage are specifically contemplated by the present invention. Further, 
alleles of the genes comprising the polynucleotide sequences provided herein are within the 
scope of the present invention. Alleles are endogenous genes that are altered as a result of 
one or more mutations, such as deletions, additions and/or substitutions of nucleotides. 
The resulting mRNA and protein may, but need not, have an altered structure or function. 
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Alleles may be identified using standard techniques (such as hybridization, amplification 
and/or database sequence comparison). 

Polynucleotides may be prepared using any of a variety of techniques. For 
example, a polynucleotide may be identified, as described in more detail below, by 
screening a microarray of cDNAs for tumor-associated expression (i.e., expression that is at 
least two fold greater in a colon tumor than in normal tissue, as determined using a 
representative assay provided herein). Such screens may be performed using a Synteni 
microarray (Palo Alto, CA) according to the manufacturer's instructions (and essentially as 
described by Schena et al., Proc. Natl Acad. Sci. USA 93: 10614-10619, 1996 and Heller et 
al., Proc. Natl. Acad. Sci. USA 94:2150-2155, 1997). Alternatively, polypeptides may be 
amplified from cDNA prepared from cells expressing the proteins described herein, such as 
colon tumor cells. Such polynucleotides may be amplified via polymerase chain reaction 
(PCR). For this approach, sequence-specific primers may be designed based on the 
sequences provided herein, and may be purchased or synthesized. 

An amplified portion may be used to isolate a full length gene from a 
suitable library (e.g., a colon tumor cDNA library) using well known techniques. Within 
such techniques, a library (cDNA or genomic) is screened using one or more 
polynucleotide probes or primers suitable for amplification. Preferably, a library is size- 
selected to include larger molecules. Random primed libraries may also be preferred for 
identifying 5 f and upstream regions of genes. Genomic libraries are preferred for obtaining 
introns and extending 5' sequences. 

For hybridization techniques, a partial sequence may be labeled (e.g., by 
nick-translation or end-labeling with 32 P) using well known techniques. A bacterial or 
bacteriophage library is then screened by hybridizing filters containing denatured bacterial 
colonies (or lawns containing phage plaques) with the labeled probe (see Sambrook et al., 
Molecular Cloning: A Laboratory Manual Cold Spring Harbor Laboratories, Cold Spring 
Harbor, NY, 1989). Hybridizing colonies or plaques are selected and expanded, and the 
DNA is isolated for further analysis. cDNA clones may be analyzed to determine the 
amount of additional sequence by, for example, PCR using a primer from the partial 
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sequence and a primer from the vector. Restriction maps and partial sequences may be 
generated to identify one or more overlapping clones. The complete sequence may then be 
determined using standard techniques, which may involve generating a series of deletion 
clones. The resulting overlapping sequences are then assembled into a single contiguous 
sequence. A full length cDNA molecule can be generated by ligating suitable fragments, 
using well known techniques. 

Alternatively, there are numerous amplification techniques for obtaining a 
full length coding sequence from a partial cDNA sequence. Within such techniques, 
amplification is generally performed via PCR. Any of a variety of commercially available 
kits may be used to perform the amplification step. Primers may be designed using, for 
example, software well known in the art. Primers are preferably 22-30 nucleotides in 
length, have a GC content of at least 50% and anneal to the target sequence at temperatures 
of about 68°C to 72°C. The amplified region may be sequenced as described above, and 
overlapping sequences assembled into a contiguous sequence. 

One such amplification technique is inverse PCR {see Triglia et al., Nucl. 
Acids Res. 76:8186, 1988), which uses restriction enzymes to generate a fragment in the 
known region of the gene. The fragment is then circularized by intramolecular ligation and 
used as a template for PCR with divergent primers derived from the known region. Within 
an alternative approach, sequences adjacent to a partial sequence may be retrieved by 
amplification with a primer to a linker sequence and a primer specific to a known region. 
The amplified sequences are typically subjected to a second round of amplification with the 
same linker primer and a second primer specific to the known region. A variation on this 
procedure, which employs two primers that initiate extension in opposite directions from 
the known sequence, is described in WO 96/38591. Another such technique is known as 
"rapid amplification of cDNA ends" or RACE. This technique involves the use of an 
internal primer and an external primer, which hybridizes to a polyA region or vector 
sequence, to identify sequences that are 5 ! and 3' of a known sequence. Additional 
techniques include capture PCR (Lagerstrom et al., PCR Methods Applic. 7:111-19, 1991) 
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and walking PCR (Parker et al., Nucl Acids Res. 79:3055-60, 1991). Other methods 
employing amplification may also be employed to obtain a full length cDNA sequence. 

In certain instances, it is possible to obtain a full length cDNA sequence by 
analysis of sequences provided in an expressed sequence tag (EST) database, such as that 
5 available from GenBank. Searches for overlapping ESTs may generally be performed 
using well known programs (e.g., NCBI BLAST searches), and such ESTs may be used to 
generate a contiguous full length sequence. 

Certain nucleic acid sequences of cDNA molecules encoding portions of 
colon tumor proteins are provided in SEQ ID NO: 1-121, 123-197, 205-630, 632-684, 686, 

10 690-691, and 694-1081. These polynucleotides were isolated from colon tumor cDNA 
libraries using conventional and/or PCR-based subtraction techniques, as described below. 

Polynucleotide variants may generally be prepared by any method known in 
the art, including chemical synthesis by, for example, solid phase phosphoramidite 
chemical synthesis. Modifications in a polynucleotide sequence may also be introduced 

15 using standard mutagenesis techniques, such as oligonucleo tide-directed site-specific 
mutagenesis (see Adelman et al., DNA 2:183, 1983). Alternatively, RNA molecules may 
be generated by in vitro or in vivo transcription of DNA sequences encoding a colon tumor 
protein, or portion thereof, provided that the DNA is incorporated into a vector with a 
suitable RNA polymerase promoter (such as T7 or SP6). Certain portions may be used to 

20 prepare an encoded polypeptide, as described herein. In addition, or alternatively, a portion 
may be administered to a patient such that the encoded polypeptide is generated in vivo 
(e.g., by transfecting antigen-presenting cells, such as dendritic cells, with a cDNA 
construct encoding a colon tumor polypeptide, and administering the transfected cells to the 
patient). 

25 A portion of a sequence complementary to a coding sequence (i.e., an 

antisense polynucleotide) may also be used as a probe or to modulate gene expression. 
cDNA constructs that can be transcribed into antisense RNA may also be introduced into 
cells of tissues to facilitate the production of antisense RNA. An antisense polynucleotide 
may be used, as described herein, to inhibit expression of a tumor protein. Antisense 
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technology can be used to control gene expression through triple-helix formation, which 
compromises the ability of the double helix to open sufficiently for the binding of 
polymerases, transcription factors or regulatory molecules (see Gee et al., In Huber and 
Carr, Molecular and Immunologic Approaches, Futura Publishing Co. (Mt. Kisco, NY; 
5 1994)). Alternatively, an antisense molecule may be designed to hybridize with a control 
region of a gene (e.g., promoter, enhancer or transcription initiation site), and block 
transcription of the gene; or to block translation by inhibiting binding of a transcript to 
ribosomes. 

A portion of a coding sequence, or of a complementary sequence, may also 
10 be designed as a probe or primer to detect gene expression. Probes may be labeled with a 
variety of reporter groups, such as radionuclides and enzymes, and are preferably at least 
10 nucleotides in length, more preferably at least 20 nucleotides in length and still more 
preferably at least 30 nucleotides in length. Primers, as noted above, are preferably 22-30 
nucleotides in length. 

15 Any polynucleotide may be further modified to increase stability in vivo. 

Possible modifications include, but are not limited to, the addition of flanking sequences at 
the 5 f and/or 3* ends; the use of phosphorothioate or 2' O-methyl rather than 
phosphodiesterase linkages in the backbone; and/or the inclusion of nontraditional bases 
such as inosine, queosine and wybutosine, as well as acetyl- methyl-, thio- and other 

20 modified forms of adenine, cytidine, guanine, thymine and uridine. 

Nucleotide sequences as described herein may be joined to a variety of other 
nucleotide sequences using established recombinant DNA techniques. For example, a 
polynucleotide may be cloned into any of a variety of cloning vectors, including plasmids, 
phagemids, lambda phage derivatives and cosmids. Vectors of particular interest include 

25 expression vectors, replication vectors, probe generation vectors and sequencing vectors. 
In general, a vector will contain an origin of replication functional in at least one organism, 
convenient restriction endonuclease sites and one or more selectable markers. Other 
elements will depend upon the desired use, and will be apparent to those of ordinary skill in 
the art. 
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Within certain embodiments, polynucleotides may be formulated so as to 
permit entry into a cell of a mammal, and expression therein. Such formulations are 
particularly useful for therapeutic purposes, as described below. Those of ordinary skill in 
the art will appreciate that there are many ways to achieve expression of a polynucleotide in 
a target cell, and any suitable method may be employed. For example, a polynucleotide 
may be incorporated into a viral vector such as, but not limited to, adenovirus, adeno- 
associated virus, retrovirus, or vaccinia or other pox virus (e.g., avian pox virus). 
Techniques for incorporating DNA into such vectors are well known to those of ordinary 
skill in the art. A retroviral vector may additionally transfer or incorporate a gene for a 
selectable marker (to aid in the identification or selection of transduced cells) and/or a 
targeting moiety, such as a gene that encodes a ligand for a receptor on a specific target 
cell, to render the vector target specific. Targeting may also be accomplished using an 
antibody, by methods known to those of ordinary skill in the art. 

Other formulations for therapeutic purposes include colloidal dispersion 
systems, such as macromolecule complexes, nanocapsules, microspheres, beads, and lipid- 
based systems including oil-in-water emulsions, micelles, mixed micelles, and liposomes. 
A preferred colloidal system for use as a delivery vehicle in vitro and in vivo is a liposome 
(i.e., an artificial membrane vesicle). The preparation and use of such systems is well 
known in the art. 

Colon Tumor Polypeptides 

Within the context of the present invention, polypeptides may comprise at 
least an immunogenic portion of a colon tumor protein or a variant thereof, as described 
herein. As noted above, a "colon tumor protein" is a protein that is expressed by colon 
tumor cells. Proteins that are colon tumor proteins also react detectably within an 
immunoassay (such as an ELISA) with antisera from a patient with colon cancer. 
Polypeptides as described herein may be of any length. Additional sequences derived from 
the native protein and/or heterologous sequences may be present, and such sequences may 
(but need not) possess further immunogenic or antigenic properties. 
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An "immunogenic portion," as used herein is a portion of a protein that is 
recognized (i.e., specifically bound) by a B-cell and/or T-cell surface antigen receptor. 
Such immunogenic portions generally comprise at least 5 amino acid residues, more 
preferably at least 10, and still more preferably at least 20 amino acid residues of a colon 
5 tumor protein or a variant thereof. Certain preferred immunogenic portions include 
peptides in which an N-terminal leader sequence and/or transmembrane domain have been 
deleted. Other preferred immunogenic portions may contain a small N- and/or C-terminal 
deletion (e.g., 1-30 amino acids, preferably 5-15 amino acids), relative to the mature 
protein. 

10 Immunogenic portions may generally be identified using well known 

techniques, such as those summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 
(Raven Press, 1993) and references cited therein. Such techniques include screening 
polypeptides for the ability to react with antigen-specific antibodies, antisera and/or T-cell 
lines or clones. As used herein, antisera and antibodies are "antigen-specific" if they 

15 specifically bind to an antigen (i.e., they react with the protein in an ELISA or other 
immunoassay, and do not react detectably with unrelated proteins). Such antisera and 
antibodies may be prepared as described herein, and using well known techniques. An 
immunogenic portion of a native colon tumor protein is a portion that reacts with such 
antisera and/or T-cells at a level that is not substantially less than the reactivity of the full 

20 length polypeptide (e.g., in an ELISA and/or T-cell reactivity assay). Such immunogenic 
portions may react within such assays at a level that is similar to or greater than the 
reactivity of the full length polypeptide. Such screens may generally be performed using 
methods well known to those of ordinary skill in the art, such as those described in Harlow 
and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. For 

25 example, a polypeptide may be immobilized on a solid support and contacted with patient 
sera to allow binding of antibodies within the sera to the immobilized polypeptide. 
Unbound sera may then be removed and bound antibodies detected using, for example, 125 I- 
labeled Protein A. 
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As noted above, a composition may comprise a variant of a native colon 
tumor protein. A polypeptide "variant," as used herein, is a polypeptide that differs from a 
native colon tumor protein in one or more substitutions, deletions, additions and/or 
insertions, such that the immunogenicity of the polypeptide is not substantially diminished. 
5 In other words, the ability of a variant to react with antigen-specific antisera may be 
enhanced or unchanged, relative to the native protein, or may be diminished by less than 
50%, and preferably less than 20%, relative to the native protein. Such variants may 
generally be identified by modifying one of the above polypeptide sequences and 
evaluating the reactivity of the modified polypeptide with antigen-specific antibodies or 
10 antisera as described herein. Preferred variants include those in which one or more 
portions, such as an N-terminal leader sequence or transmembrane domain, have been 
removed. Other preferred variants include variants in which a small portion (e.g., 1-30 
amino acids, preferably 5-15 amino acids) has been removed from the N- and/or C-terminal 
of the mature protein. 

15 Polypeptide variants preferably exhibit at least about 70%, more preferably 

at least about 90% and most preferably at least about 95% identity (determined as described 
above) to the identified polypeptides. 

Preferably, a variant contains conservative substitutions. A "conservative 
substitution" is one in which an amino acid is substituted for another amino acid that has 

20 similar properties, such that one skilled in the art of peptide chemistry would expect the 
secondary structure and hydropathic nature of the polypeptide to be substantially 
unchanged. Amino acid substitutions may generally be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic nature of 
the residues. For example, negatively charged amino acids include aspartic acid and 

25 glutamic acid; positively charged amino acids include lysine and arginine; and amino acids 
with uncharged polar head groups having similar hydrophilicity values include leucine, 
isoleucine and valine; glycine and alanine; asparagine and glutamine; and serine, threonine, 
phenylalanine and tyrosine. Other groups of amino acids that may represent conservative 
changes include: (1) ala, pro, gly, glu, asp, gin, asn, ser, thr; (2) cys, ser, tyr, thr; (3) val, 
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ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, his. A variant may also, or 
alternatively, contain non-conservative changes. In a preferred embodiment, variant 
polypeptides differ from a native sequence by substitution, deletion or addition of five 
amino acids or fewer. Variants may also (or alternatively) be modified by, for example, the 
5 deletion or addition of amino acids that have minimal influence on the immunogenicity, 
secondary structure and hydropathic nature of the polypeptide. 

As noted above, polypeptides may comprise a signal (or leader) sequence at 
the N-terminal end of the protein which co-translationally or post-translationally directs 
transfer of the protein. The polypeptide may also be conjugated to a linker or other 

10 sequence for ease of synthesis, purification or identification of the polypeptide (e.g., poly- 
His), or to enhance binding of the polypeptide to a solid support. For example, a 
polypeptide may be conjugated to an immunoglobulin Fc region. 

Polypeptides may be prepared using any of a variety of well known 
techniques. Recombinant polypeptides encoded by DNA sequences as described above 

15 may be readily prepared from the DNA sequences using any of a variety of expression 
vectors known to those of ordinary skill in the art. Expression may be achieved in any 
appropriate host cell that has been transformed or transfected with an expression vector 
containing a DNA molecule that encodes a recombinant polypeptide. Suitable host cells 
include prokaryotes, yeast and higher eukaryotic cells. Preferably, the host cells employed 

20 are E. coli, yeast or a mammalian cell line such as COS or CHO. Supernatants from 
suitable host/vector systems which secrete recombinant protein or polypeptide into culture 
media may be first concentrated using a commercially available filter. Following 
concentration, the concentrate may be applied to a suitable purification matrix such as an 
affinity matrix or an ion exchange resin. Finally, one or more reverse phase HPLC steps 

25 can be employed to further purify a recombinant polypeptide. 

Portions and other variants having fewer than about 100 amino acids, and 
generally fewer than about 50 amino acids, may also be generated by synthetic means, 
using techniques well known to those of ordinary skill in the art. For example, such 
polypeptides may be synthesized using any of the commercially available solid-phase 
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techniques, such as the Merrifield solid-phase synthesis method, where amino acids are 
sequentially added to a growing amino acid chain. See Merrifield, J. Am. Chem. Soc. 
55:2149-2146, 1963. Equipment for automated synthesis of polypeptides is commercially 
available from suppliers such as Perkin Elmer/ Applied BioSystems Division (Foster City, 
5 CA), and may be operated according to the manufacturer's instructions. 

Within certain specific embodiments, a polypeptide may be a fusion protein 
that comprises multiple polypeptides as described herein, or that comprises at least one 
polypeptide as described herein and an unrelated sequence, such as a known tumor protein. 
A fusion partner may, for example, assist in providing T helper epitopes (an immunological 

10 fusion partner), preferably T helper epitopes recognized by humans, or may assist in 
expressing the protein (an expression enhancer) at higher yields than the native 
recombinant protein. Certain preferred fusion partners are both immunological and 
expression enhancing fusion partners. Other fusion partners may be selected so as to 
increase the solubility of the protein or to enable the protein to be targeted to desired 

15 intracellular compartments. Still further fusion partners include affinity tags, which 
facilitate purification of the protein. 

Fusion proteins may generally be prepared using standard techniques, 
including chemical conjugation. Preferably, a fusion protein is expressed as a recombinant 
protein, allowing the production of increased levels, relative to a non-fused protein, in an 

20 expression system. Briefly, DNA sequences encoding the polypeptide components may be 
assembled separately, and ligated into an appropriate expression vector. The 3 f end of the 
DNA sequence encoding one polypeptide component is ligated, with or without a peptide 
linker, to the 5' end of a DNA sequence encoding the second polypeptide component so 
that the reading frames of the sequences are in phase. This permits translation into a single 

25 fusion protein that retains the biological activity of both component polypeptides. 

A peptide linker sequence may be employed to separate the first and the 
second polypeptide components by a distance sufficient to ensure that each polypeptide 
folds into its secondary and tertiary structures. Such a peptide linker sequence is 
incorporated into the fusion protein using standard techniques well known in the art. 
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Suitable peptide linker sequences may be chosen based on the following factors: (1) their 
ability to adopt a flexible extended conformation; (2) their inability to adopt a secondary 
structure that could interact with functional epitopes on the first and second polypeptides; 
and (3) the lack of hydrophobic or charged residues that might react with the polypeptide 
functional epitopes. Preferred peptide linker sequences contain Gly, Asn and Ser residues. 
Other near neutral amino acids, such as Thr and Ala may also be used in the linker 
sequence. Amino acid sequences which may be usefully employed as linkers include those 
disclosed in Maratea et al., Gene 40:39-46, 1985; Murphy et al., Proc. Natl. Acad. ScL USA 
53:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. Patent No. 4,751,180. The linker 
sequence may generally be from 1 to about 50 amino acids in length. Linker sequences are 
not required when the first and second polypeptides have non-essential N-terminal amino 
acid regions that can be used to separate the functional domains and prevent steric 
interference. 

The ligated DNA sequences are operably linked to suitable transcriptional or 
translational regulatory elements. The regulatory elements responsible for expression of 
DNA are located only 5' to the DNA sequence encoding the first polypeptides. Similarly, 
stop codons required to end translation and transcription termination signals are only 
present 3 ! to the DNA sequence encoding the second polypeptide. 

Fusion proteins are also provided that comprise a polypeptide of the present 
invention together with an unrelated immunogenic protein. Preferably the immunogenic 
protein is capable of eliciting a recall response. Examples of such proteins include tetanus, 
tuberculosis and hepatitis proteins (see, for example, Stoute etal. New Engl. J. Med., 
335:86-91, 1997). 

Within preferred embodiments, an immunological fusion partner is derived 
from protein D, a surface protein of the gram-negative bacterium Haemophilus influenza B 
(WO 91/18926). Preferably, a protein D derivative comprises approximately the first third 
of the protein (e.g., the first N-terminal 100-110 amino acids), and a protein D derivative 
may be lipidated. Within certain preferred embodiments, the first 109 residues of a 
Lipoprotein D fusion partner is included on the N-terminus to provide the polypeptide with 
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additional exogenous T-cell epitopes and to increase the expression level in E. coli (thus 
functioning as an expression enhancer). The lipid tail ensures optimal presentation of the 
antigen to antigen presenting cells. Other fusion partners include the non-structural protein 
from influenzae virus, NS1 (hemaglutinin). Typically, the N-terminal 81 amino acids are 
5 used, although different fragments that include T-helper epitopes may be used. 

In another embodiment, the immunological fusion partner is the protein 
known as LYTA, or a portion thereof (preferably a C-terminal portion). LYTA is derived 
from Streptococcus pneumoniae, which synthesizes an N-acetyl-L-alanine amidase known 
as amidase LYTA (encoded by the LytA gene; Gene 43:265-292, 1986). LYTA is an 

10 autolysin that specifically degrades certain bonds in the peptidoglycan backbone. The C- 
terminal domain of the LYTA protein is responsible for the affinity to the choline or to 
some choline analogues such as DEAE. This property has been exploited for the 
development of E. coli C-LYTA expressing plasmids useful for expression of fusion 
proteins. Purification of hybrid proteins containing the C-LYTA fragment at the amino 

15 terminus has been described (see Biotechnology 70:795-798, 1992). Within a preferred 
embodiment, a repeat portion of LYTA may be incorporated into a fusion protein. A repeat 
portion is found in the C-terminal region starting at residue 178. A particularly preferred 
repeat portion incorporates residues 188-305. 

In general, polypeptides (including fusion proteins) and polynucleotides as 

20 described herein are isolated. An "isolated" polypeptide or polynucleotide is one that is 
removed from its original environment. For example, a naturally-occurring protein is 
isolated if it is separated from some or all of the coexisting materials in the natural system. 
Preferably, such polypeptides are at least about 90% pure, more preferably at least about 
95% pure and most preferably at least about 99% pure. A polynucleotide is considered to 

25 be isolated if, for example, it is cloned into a vector that is not a part of the natural 
environment. 
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Binding Agents 

The present invention further provides agents, such as antibodies and 
antigen-binding fragments thereof, that specifically bind to a colon tumor protein. As used 
herein, an antibody, or antigen-binding fragment thereof, is said to "specifically bind" to a 
colon tumor protein if it reacts at a detectable level (within, for example, an ELISA) with a 
colon tumor protein, and does not react detectably with unrelated proteins under similar 
conditions. As used herein, "binding" refers to a noncovalent association between two 
separate molecules such that a complex is formed. The ability to bind may be evaluated by, 
for example, determining a binding constant for the formation of the complex. The binding 
constant is the value obtained when the concentration of the complex is divided by the 
product of the component concentrations. In general, two compounds are said to "bind," in 
the context of the present invention, when the binding constant for complex formation 
exceeds about 10 3 L/mol. The binding constant may be determined using methods well 
known in the art. 

Binding agents may be further capable of differentiating between patients 
with and without a cancer, such as colon cancer, using the representative assays provided 
herein. In other words, antibodies or other binding agents that bind to a colon tumor 
protein will generate a signal indicating the presence of a cancer in at least about 20% of 
patients with the disease, and will generate a negative signal indicating the absence of the 
disease in at least about 90% of individuals without the cancer. To determine whether a 
binding agent satisfies this requirement, biological samples (e.g., blood, sera, sputum, urine 
and/or tumor biopsies) from patients with and without a cancer (as determined using 
standard clinical tests) may be assayed as described herein for the presence of polypeptides 
that bind to the binding agent. It will be apparent that a statistically significant number of 
samples with and without the disease should be assayed. Each binding agent should satisfy 
the above criteria; however, those of ordinary skill in the art will recognize that binding 
agents may be used in combination to improve sensitivity. 

Any agent that satisfies the above requirements may be a binding agent. For 
example, a binding agent may be a ribosome, with or without a peptide component, an 
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RNA molecule or a polypeptide. In a preferred embodiment, a binding agent is an antibody 
or an antigen-binding fragment thereof. Antibodies may be prepared by any of a variety of 
techniques known to those of ordinary skill in the art. See, e.g., Harlow and Lane, 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In general, 

5 antibodies can be produced by cell culture techniques, including the generation of 
monoclonal antibodies as described herein, or via transfection of antibody genes into 
suitable bacterial or mammalian cell hosts, in order to allow for the production of 
recombinant antibodies. In one technique, an immunogen comprising the polypeptide is 
initially injected into any of a wide variety of mammals (e.g., mice, rats, rabbits, sheep or 

10 goats). In this step, the polypeptides of this invention may serve as the immunogen without 
modification. Alternatively, particularly for relatively short polypeptides, a superior 
immune response may be elicited if the polypeptide is joined to a carrier protein, such as 
bovine serum albumin or keyhole limpet hemocyanin. The immunogen is injected into the 
animal host, preferably according to a predetermined schedule incorporating one or more 

15 booster immunizations, and the animals are bled periodically. Polyclonal antibodies 
specific for the polypeptide may then be purified from such antisera by, for example, 
affinity chromatography using the polypeptide coupled to a suitable solid support. 

Monoclonal antibodies specific for an antigenic polypeptide of interest may 
be prepared, for example, using the technique of Kohler and Milstein, Eur. J. Immunol 

20 6:511-519, 1976, and improvements thereto. Briefly, these methods involve the 
preparation of immortal cell lines capable of producing antibodies having the desired 
specificity (i.e., reactivity with the polypeptide of interest). Such cell lines may be 
produced, for example, from spleen cells obtained from an animal immunized as described 
above. The spleen cells are then immortalized by, for example, fusion with a myeloma cell 

25 fusion partner, preferably one that is syngeneic with the immunized animal. A variety of 
fusion techniques may be employed. For example, the spleen cells and myeloma cells may 
be combined with a nonionic detergent for a few minutes and then plated at low density on 
a selective medium that supports the growth of hybrid cells, but not myeloma cells. A 
preferred selection technique uses HAT (hypoxanthine, aminopterin, thymidine) selection. 
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After a sufficient time, usually about 1 to 2 weeks, colonies of hybrids are observed. 
Single colonies are selected and their culture supernatants tested for binding activity against 
the polypeptide. Hybridomas having high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supernatants of growing 
5 hybridoma colonies. In addition, various techniques may be employed to enhance the 
yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from the 
ascites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 

10 extraction. The polypeptides of this invention may be used in the purification process in, 
for example, an affinity chromatography step. 

Within certain embodiments, the use of antigen-binding fragments of 
antibodies may be preferred. Such fragments include Fab fragments, which may be 
prepared using standard techniques. Briefly, immunoglobulins may be purified from rabbit 

15 serum by affinity chromatography on Protein A bead columns (Harlow and Lane, 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988) and digested by 
papain to yield Fab and Fc fragments. The Fab and Fc fragments may be separated by 
affinity chromatography on protein A bead columns. 

Monoclonal antibodies of the present invention may be coupled to one or 

20 more therapeutic agents. Suitable agents in this regard include radionuclides, 
differentiation inducers, drugs, toxins, and derivatives thereof. Preferred radionuclides 
include 90 Y, 123 I, ,25 I, ,31 I, ,86 Re, ,88 Re, 2,1 At, and 2,2 Bi. Preferred drugs include 
methotrexate, and pyrimidine and purine analogs. Preferred differentiation inducers 
include phorbol esters and butyric acid. Preferred toxins include ricin, abrin, diptheria 

25 toxin, cholera toxin, gelonin, Pseudomonas exotoxin, Shigella toxin, and pokeweed 
antiviral protein. 

A therapeutic agent may be coupled {e.g., covalently bonded) to a suitable 
monoclonal antibody either directly or indirectly (e.g., via a linker group). A direct 
reaction between an agent and an antibody is possible when each possesses a substituent 
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capable of reacting with the other. For example, a nucleophilic group, such as an amino or 
sulfhydryl group, on one may be capable of reacting with a carbonyl-containing group, 
such as an anhydride or an acid halide, or with an alkyl group containing a good leaving 
group (e.g., a halide) on the other. 
5 Alternatively, it may be desirable to couple a therapeutic agent and an 

antibody via a linker group. A linker group can function as a spacer to distance an antibody 
from an agent in order to avoid interference with binding capabilities. A linker group can 
also serve to increase the chemical reactivity of a substituent on an agent or an antibody, 
and thus increase the coupling efficiency. An increase in chemical reactivity may also 
10 facilitate the use of agents, or functional groups on agents, which otherwise would not be 
possible. 

It will be evident to those skilled in the art that a variety of bifimctional or 
polyfunctional reagents, both homo- and hetero-fiinctional (such as those described in the 
catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker group. 

15 Coupling may be effected, for example, through amino groups, carboxyl groups, sulfhydryl 
groups or oxidized carbohydrate residues. There are numerous references describing such 
methodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al. 

Where a therapeutic agent is more potent when free from the antibody 
portion of the immunoconjugates of the present invention, it may be desirable to use a 

20 linker group which is cleavable during or upon internalization into a cell. A number of 
different cleavable linker groups have been described. The mechanisms for the 
intracellular release of an agent from these linker groups include cleavage by reduction of a 
disulfide bond (e.g., U.S. Patent No. 4,489,710, to Spitler), by irradiation of a photolabile 
bond (e.g., U.S. Patent No. 4,625,014, to Senter et al.), by hydrolysis of derivatized amino 

25 acid side chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al.), by serum complement- 
mediated hydrolysis (e.g., U.S. Patent No. 4,671,958, to Rodwell et al), and acid-catalyzed 
hydrolysis (e.g., U.S. Patent No. 4,569,789, to Blattler et al.). 

It may be desirable to couple more than one agent to an antibody. In one 
embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
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another embodiment, more than one type of agent may be coupled to one antibody. 
Regardless of the particular embodiment, immunoconjugates with more than one agent 
may be prepared in a variety of ways. For example, more than one agent may be coupled 
directly to an antibody molecule, or linkers which provide multiple sites for attachment can 
5 be used. Alternatively, a carrier can be used. 

A carrier may bear the agents in a variety of ways, including covalent 
bonding either directly or via a linker group. Suitable carriers include proteins such as 
albumins (e.g., U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides 
such as aminodextran (e.g., U.S. Patent No. 4,699,784, to Shih et al.). A carrier may also 

10 bear an agent by noncovalent bonding or by encapsulation, such as within a liposome 
vesicle (e.g., U.S. Patent Nos. 4,429,008 and 4,873,088). Carriers specific for radionuclide 
agents include radiohalogenated small molecules and chelating compounds. For example, 
U.S. Patent No. 4,735,792 discloses representative radiohalogenated small molecules and 
their synthesis. A radionuclide chelate may be formed from chelating compounds that 

15 include those containing nitrogen and sulfur atoms as the donor atoms for binding the 
metal, or metal oxide, radionuclide. For example, U.S. Patent No. 4,673,562, to Davison 
et al. discloses representative chelating compounds and their synthesis. 

A variety of routes of administration for the antibodies and 
immunoconjugates may be used. Typically, administration will be intravenous, 

20 intramuscular, subcutaneous or in the bed of a resected tumor. It will be evident that the 
precise dose of the antibody/immunoconjugate will vary depending upon the antibody 
used, the antigen density on the tumor, and the rate of clearance of the antibody. 

T Cells 

25 Immunotherapeutic compositions may also, or alternatively, comprise T 

cells specific for a colon tumor protein. Such cells may generally be prepared in vitro or ex 
vivo, using standard procedures. For example, T cells may be isolated from bone marrow, 
peripheral blood, or a fraction of bone marrow or peripheral blood of a patient, using a 
commercially available cell separation system, such as the ISOLEX™ system, available 
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from Nexell Therapeutics Inc., Irvine, CA . Alternatively, T cells may be derived from 
related or unrelated humans, non-human mammals, cell lines or cultures. 

T cells may be stimulated with a colon tumor polypeptide, polynucleotide 
encoding a colon tumor polypeptide and/or an antigen presenting cell (APC) that expresses 
5 such a polypeptide. Such stimulation is performed under conditions and for a time 
sufficient to permit the generation of T cells that are specific for the polypeptide. 
Preferably, a colon tumor polypeptide or polynucleotide is present within a delivery 
vehicle, such as a microsphere, to facilitate the generation of specific T cells. 

T cells are considered to be specific for a colon tumor polypeptide if the T 

10 cells kill target cells coated with the polypeptide or expressing a gene encoding the 
polypeptide. T cell specificity may be evaluated using any of a variety of standard 
techniques. For example, within a chromium release assay or proliferation assay, a 
stimulation index of more than two fold increase in lysis and/or proliferation, compared to 
negative controls, indicates T cell specificity. Such assays may be performed, for example, 

15 as described in Chen et al., Cancer Res. 54:1065-1070, 1994. Alternatively, detection of 
the proliferation of T cells may be accomplished by a variety of known techniques. For 
example, T cell proliferation can be detected by measuring an increased rate of DNA 
synthesis (e.g., by pulse-labeling cultures of T cells with tritiated thymidine and measuring 
the amount of tritiated thymidine incorporated into DNA). Contact with a colon tumor 

20 polypeptide (100 ng/ml - 100 |ig/ml, preferably 200 ng/ml - 25 jug/ml) for 3 - 7 days should 
result in at least a two fold increase in proliferation of the T cells. Contact as described 
above for 2-3 hours should result in activation of the T cells, as measured using standard 
cytokine assays in which a two fold increase in the level of cytokine release (e.g., TNF or 
EFN-y) is indicative of T cell activation (see Coligan et al., Current Protocols in 

25 Immunology, vol. 1, Wiley Interscience (Greene 1998)). T cells that have been activated in 
response to a colon tumor polypeptide, polynucleotide or polypeptide-expressing APC may 
be CD4 + and/or CD8 + . Colon tumor protein-specific T cells may be expanded using 
standard techniques. Within preferred embodiments, the T cells are derived from either a 
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patient or a related, or unrelated, donor and are administered to the patient following 
stimulation and expansion. 

For therapeutic purposes, CD4 + or CD8+ T cells that proliferate in response 
to a colon tumor polypeptide, polynucleotide or APC can be expanded in number either 
5 in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a variety 
of ways. For example, the T cells can be re-exposed to a colon tumor polypeptide, or a 
short peptide corresponding to an immunogenic portion of such a polypeptide, with or 
without the addition of T cell growth factors, such as interleukin-2, and/or stimulator cells 
that synthesize a colon tumor polypeptide. Alternatively, one or more T cells that 
10 proliferate in the presence of a colon tumor protein can be expanded in number by cloning. 
Methods for cloning cells are well known in the art, and include limiting dilution. 

Pharmaceutical Compositions and Vaccines 

Within certain aspects, polypeptides, polynucleotides, T cells and/or binding 

15 agents disclosed herein may be incorporated into pharmaceutical compositions or 
immunogenic compositions (i.e., vaccines). Pharmaceutical compositions comprise one or 
more such compounds and a physiologically acceptable carrier. Vaccines may comprise 
one or more such compounds and an immunostimulant. An immunostimulant may be any 
substance that enhances or potentiates an immune response to an exogenous antigen. 

20 Examples of immunostimulants include adjuvants, biodegradable microspheres (e.g., 
polylactic galactide) and liposomes (into which the compound is incorporated; see e.g., 
Fullerton, U.S. Patent No. 4,235,877). Vaccine preparation is generally described in, for 
example, M.F. Powell and M.J. Newman, eds., "Vaccine Design (the subunit and adjuvant 
approach)," Plenum Press (NY, 1995). Pharmaceutical compositions and vaccines within 

25 the scope of the present invention may also contain other compounds, which may be 
biologically active or inactive. For example, one or more immunogenic portions of other 
tumor antigens may be present, either incorporated into a fusion polypeptide or as a 
separate compound, within the composition or vaccine. 

A pharmaceutical composition or vaccine may contain DNA encoding one 
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or more of the polypeptides as described above, such that the polypeptide is generated in 
situ. As noted above, the DNA may be present within any of a variety of delivery systems 
known to those of ordinary skill in the art, including nucleic acid expression systems, 
bacteria and viral expression systems. Numerous gene delivery techniques are well known 
5 in the art, such as those described by Rolland, Crit. Rev. Therap. Drug Carrier Systems 
75:143-198, 1998, and references cited therein. Appropriate nucleic acid expression 
systems contain the necessary DNA sequences for expression in the patient (such as a 
suitable promoter and terminating signal). Bacterial delivery systems involve the 
administration of a bacterium (such as Bacillus-Calmette-Guerriri) that expresses an 

10 immunogenic portion of the polypeptide on its cell surface or secretes such an epitope. In a 
preferred embodiment, the DNA may be introduced using a viral expression system (e.g., 
vaccinia or other pox virus, retrovirus, or adenovirus), which may involve the use of a non- 
pathogenic (defective), replication competent virus. Suitable systems are disclosed, for 
example, in Fisher-Hoch et al., Proc. Natl. Acad. Sci. USA 55:317-321, 1989; Flexner et 

15 al, Ann. N.Y. Acad. Set 559:86-103, 1989; Flexner et al, Vaccine 5:17-21, 1990; U.S. 
Patent Nos. 4,603,112, 4,769,330, and 5,017,487; WO 89/01973; U.S. Patent 
No. 4,777,127; GB 2,200,651; EP 0,345,242; WO 91/02805; Berkner, Biotechniques 
5:616-627, 1988; Rosenfeld et al, Science 252:431-434, 1991; Kolls et al, Proc. Natl 
Acad. Set USA 97:215-219, 1994; Kass-Eisler et al., Proc. Natl Acad. Sci. USA 

20 90:11498-11502, 1993; Guzman et al., Circulation 55:2838-2848, 1993; and Guzman et 
al., Cir. Res. 73:1202-1207, 1993. Techniques for incorporating DNA into such expression 
systems are well known to those of ordinary skill in the art. The DNA may also be 
"naked," as described, for example, in Ulmer et al., Science 259:1745-1749, 1993 and 
reviewed by Cohen, Science 259:1691-1692, 1993. The uptake of naked DNA may be 

25 increased by coating the DNA onto biodegradable beads, which are efficiently transported 
into the cells. 

While any suitable carrier known to those of ordinary skill in the art may be 
employed in the pharmaceutical compositions of this invention, the type of carrier will vary 
depending on the mode of administration. Compositions of the present invention may be 
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formulated for any appropriate manner of administration, including for example, topical, 
oral, nasal, intravenous, intracranial, intraperitoneal, subcutaneous or intramuscular 
administration. For parenteral administration, such as subcutaneous injection, the carrier 
preferably comprises water, saline, alcohol, a fat, a wax or a buffer. For oral 
administration, any of the above carriers or a solid carrier, such as mannitol, lactose, starch, 
magnesium stearate, sodium saccharine, talcum, cellulose, glucose, sucrose, and 
magnesium carbonate, may be employed. Biodegradable microspheres (e.g., polylactate 
polyglycolate) may also be employed as carriers for the pharmaceutical compositions of 
this invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 
Patent Nos. 4,897,268 and 5,075,109. 

Such compositions may also comprise buffers (e.g., neutral buffered saline 
or phosphate buffered saline), carbohydrates (e.g., glucose, mannose, sucrose or dextrans), 
mannitol, proteins, polypeptides or amino acids such as glycine, antioxidants, chelating 
agents such as EDTA or glutathione, adjuvants (e.g., aluminum hydroxide) and/or 
preservatives. Alternatively, compositions of the present invention may be formulated as a 
lyophilizate. Compounds may also be encapsulated within liposomes using well known 
technology. 

Any of a variety of immunostimulants may be employed in the vaccines of 
this invention. For example, an adjuvant may be included. Most adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum 
hydroxide or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella 
pertussis or Mycobacterium tuberculosis derived proteins. Suitable adjuvants are 
commercially available as, for example, Freund's Incomplete Adjuvant and Complete 
Adjuvant (Difco Laboratories, Detroit, MI); Merck Adjuvant 65 (Merck and Company, 
Inc., Rahway, NJ); AS-2 (SmithKline Beecham, Philadelphia, PA); aluminum salts such as 
aluminum hydroxide gel (alum) or aluminum phosphate; salts of calcium, iron or zinc; an 
insoluble suspension of acylated tyrosine; acylated sugars; cationically or anionically 
derivatized polysaccharides; polyphosphazenes; biodegradable microspheres; 
monophosphoryl lipid A and quil A. Cytokines, such as GM-CSF or interleukin-2, -7, or - 
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12, may also be used as adjuvants. 

Within the vaccines provided herein, the adjuvant composition is preferably 
designed to induce an immune response predominantly of the Thl type. High levels of 
Thl-type cytokines (e.g., IFN-y, TNFa, IL-2 and IL-12) tend to favor the induction of cell 
5 mediated immune responses to an administered antigen. In contrast, high levels of Th2- 
type cytokines (e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the induction of humoral 
immune responses. Following application of a vaccine as provided herein, a patient will 
support an immune response that includes Thl- and Th2-type responses. Within a 
preferred embodiment, in which a response is predominantly Thl-type, the level of Thl- 
10 type cytokines will increase to a greater extent than the level of Th2-type cytokines. The 
levels of these cytokines may be readily assessed using standard assays. For a review of 
% the families of cytokines, see Mosmann and Cofftnan, Ann. Rev. Immunol. 7:145-173, 

m 1989. 

Preferred adjuvants for use in eliciting a predominantly Thl-type response 
rT 15 include, for example, a combination of monophosphoryl lipid A, preferably 3-de-O- 

H acylated monophosphoryl lipid A (3D-MPL), together with an aluminum salt. MPL 

O adjuvants are available from Corixa Corp. (Seattle, WA) (see US Patent Nos. 4,436,727; 

4,877,61 1; 4,866,034 and 4,912,094). CpG-containing oligonucleotides (in which the CpG 
^ dinucleotide is unmethylated) also induce a predominantly Thl response. Such 

□ 20 oligonucleotides are well known and are described, for example, in WO 96/02555 and WO 

99/33488. Immunostimulatory DNA sequences are also described, for example, by Sato et 
al., Science 273:352, 1996. Another preferred adjuvant is a saponin, preferably QS21 
(Aquila Biopharmaceuticals Inc., Framingham, MA), which may be used alone or in 
combination with other adjuvants. For example, an enhanced system involves the 
25 combination of a monophosphoryl lipid A and saponin derivative, such as the combination 
of QS21 and 3D-MPL as described in WO 94/00153, or a less reactogenic composition 
where the QS21 is quenched with cholesterol, as described in WO 96/33739. Other 
preferred formulations comprises an oil-in-water emulsion and tocopherol. A particularly 
potent adjuvant formulation involving QS21, 3D-MPL and tocopherol in an oil-in-water 
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emulsion is described in WO 95/17210. 

Other preferred adjuvants include Montanide ISA 720 (Seppic, France), 
SAF (Chiron, California, United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS 
series of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, 

5 Rixensart, Belgium), Detox (Ribi ImmunoChem Research Inc., Hamilton, MT), RC-529 
(Corixa, Seattle, WA) and Aminoalkyl glucosaminide 4-phosphates (AGPs). 

Any vaccine provided herein may be prepared using well known methods 
that result in a combination of antigen, immune response enhancer and a suitable carrier or 
excipient. The compositions described herein may be administered as part of a sustained 

10 release formulation (i.e., a formulation such as a capsule, sponge or gel (composed of 
polysaccharides, for example) that effects a slow release of compound following 
administration). Such formulations may generally be prepared using well known 
technology {see, e.g. Coombes et al., Vaccine 74:1429-1438, 1996) and administered by, 
for example, oral, rectal or subcutaneous implantation, or by implantation at the desired 

15 target site. Sustained-release formulations may contain a polypeptide, polynucleotide or 
antibody dispersed in a carrier matrix and/or contained within a reservoir surrounded by a 
rate controlling membrane. 

Carriers for use within such formulations are biocompatible, and may also 
be biodegradable; preferably the formulation provides a relatively constant level of active 

20 component release. Such carriers include microparticles of poly(lactide-co-glycolide), as 
well as polyacrylate, latex, starch, cellulose and dextran. Other delayed-release carriers 
include supramolecular biovectors, which comprise a non-liquid hydrophilic core (e.g., a 
cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer 
comprising an amphiphilic compound, such as a phospholipid (see e.g., U.S. Patent No. 

25 5,151,254 and PCT applications WO 94/20078, WO/94/23701 and WO 96/06638). The 
amount of active compound contained within a sustained release formulation depends upon 
the site of implantation, the rate and expected duration of release and the nature of the 
condition to be treated or prevented. 

Any of a variety of delivery vehicles may be employed within 
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pharmaceutical compositions and vaccines to facilitate production of an antigen-specific 
immune response that targets tumor cells. Delivery vehicles include antigen presenting 
cells (APCs), such as dendritic cells, macrophages, B cells, monocytes and other cells that 
may be engineered to be efficient APCs. Such cells may, but need not, be genetically 
5 modified to increase the capacity for presenting the antigen, to improve activation and/or 
maintenance of the T cell response, to have anti-tumor effects per se and/or to be 
immunologically compatible with the receiver {i.e., matched HLA haplotype). APCs may 
generally be isolated from any of a variety of biological fluids and organs, including tumor 
and peritumoral tissues, and may be autologous, allogeneic, syngeneic or xenogeneic cells. 

10 Certain preferred embodiments of the present invention use dendritic cells or 

progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent APCs 
(Banchereau and Steinman, Nature 392:245-251, 1998) and have been shown to be 
effective as a physiological adjuvant for eliciting prophylactic or therapeutic antitumor 
immunity {see Timmerman and Levy, Ann. Rev. Med. 50:507-529, 1999). In general, 

15 dendritic cells may be identified based on their typical shape (stellate in situ, with marked 
cytoplasmic processes (dendrites) visible in vitro), their ability to take up, process and 
present antigens with high efficiency, and their ability to activate naive T cell responses. 
Dendritic cells may, of course, be engineered to express specific cell-surface receptors or 
ligands that are not commonly found on dendritic cells in vivo or ex vivo, and such 

20 modified dendritic cells are contemplated by the present invention. As an alternative to 
dendritic cells, secreted vesicles antigen-loaded dendritic cells (called exosomes) may be 
used within a vaccine {see Zitvogel et al., Nature Med. 4:594-600, 1998). 

Dendritic cells and progenitors may be obtained from peripheral blood, bone 
marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph nodes, spleen, 

25 skin, umbilical cord blood or any other suitable tissue or fluid. For example, dendritic cells 
may be differentiated ex vivo by adding a combination of cytokines such as GM-CSF, IL-4, 
IL-13 and/or TNFa to cultures of monocytes harvested from peripheral blood. 
Alternatively, CD34 positive cells harvested from peripheral blood, umbilical cord blood or 
bone marrow may be differentiated into dendritic cells by adding to the culture medium 
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combinations of GM-CSF, IL-3, TNFa, CD40 ligand, LPS, flt3 ligand and/or other 
compound(s) that induce differentiation, maturation and proliferation of dendritic cells. 

Dendritic cells are conveniently categorized as "immature" and "mature" 
cells, which allows a simple way to discriminate between two well characterized 
5 phenotypes. However, this nomenclature should not be construed to exclude all possible 
intermediate stages of differentiation. Immature dendritic cells are characterized as APC 
with a high capacity for antigen uptake and processing, which correlates with the high 
expression of Fey receptor and mannose receptor. The mature phenotype is typically 
characterized by a lower expression of these markers, but a high expression of cell surface 
10 molecules responsible for T cell activation such as class I and class II MHC, adhesion 
molecules (e.g., CD54 and CD11) and costimulatory molecules (e.g., CD40, CD80, CD86 
and 4- IBB). 

APCs may generally be transfected with a polynucleotide encoding a colon 
tumor protein (or portion or other variant thereof) such that the colon tumor polypeptide, or 

15 an immunogenic portion thereof, is expressed on the cell surface. Such transfection may 
take place ex vivo, and a composition or vaccine comprising such transfected cells may then 
be used for therapeutic purposes, as described herein. Alternatively, a gene delivery 
vehicle that targets a dendritic or other antigen presenting cell may be administered to a 
patient, resulting in transfection that occurs in vivo. In vivo and ex vivo transfection of 

20 dendritic cells, for example, may generally be performed using any methods known in the 
art, such as those described in WO 97/24447, or the gene gun approach described by Mahvi 
et al., Immunology and cell Biology 75:456-460, 1997. Antigen loading of dendritic cells 
may be achieved by incubating dendritic cells or progenitor cells with the colon tumor 
polypeptide, DNA (naked or within a plasmid vector) or RNA; or with antigen-expressing 

25 recombinant bacterium or viruses (e.g. T vaccinia, fowlpox, adenovirus or lentivirus 
vectors). Prior to loading, the polypeptide may be covalently conjugated to an 
immunological partner that provides T cell help (e.g., a carrier molecule). Alternatively, a 
dendritic cell may be pulsed with a non-conjugated immunological partner, separately or in 
the presence of the polypeptide. 
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Vaccines and pharmaceutical compositions may be presented in unit-dose or 
multi-dose containers, such as sealed ampoules or vials. Such containers are preferably 
hermetically sealed to preserve sterility of the formulation until use. In general, 
formulations may be stored as suspensions, solutions or emulsions in oily or aqueous 
5 vehicles. Alternatively, a vaccine or pharmaceutical composition may be stored in a freeze- 
dried condition requiring only the addition of a sterile liquid carrier immediately prior to 
use. 

Cancer Therapy 

In further aspects of the present invention, the compositions described 
herein may be used for immunotherapy of cancer, such as colon cancer. Within such 
methods, pharmaceutical compositions and vaccines are typically administered to a patient. 
As used herein, a "patient" refers to any warm-blooded animal, preferably a human. A 
patient may or may not be afflicted with cancer. Accordingly, the above pharmaceutical 
compositions and vaccines may be used to prevent the development of a cancer or to treat a 
patient afflicted with a cancer. A cancer may be diagnosed using criteria generally 
accepted in the art, including the presence of a malignant tumor. Pharmaceutical 
compositions and vaccines may be administered either prior to or following surgical 
removal of primary tumors and/or treatment such as administration of radiotherapy or 
conventional chemotherapeutic drugs. 

Within certain embodiments, immunotherapy may be active 
immunotherapy, in which treatment relies on the in vivo stimulation of the endogenous host 
immune system to react against tumors with the administration of immune response- 
modifying agents (such as polypeptides and polynucleotides disclosed herein). 

Within other embodiments, immunotherapy may be passive 
immunotherapy, in which treatment involves the delivery of agents with established tumor- 
immune reactivity (such as effector cells or antibodies) that can directly or indirectly 
mediate antitumor effects and does not necessarily depend on an intact host immune 
system. Examples of effector cells include T cells as discussed above, T lymphocytes 
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(such as CD8 + cytotoxic T lymphocytes and CD4 + T-helper tumor-infiltrating 
lymphocytes), killer cells (such as Natural Killer cells and lymphokine-activated killer 
cells), B cells and antigen-presenting cells (such as dendritic cells and macrophages) 
expressing a polypeptide provided herein. T cell receptors and antibody receptors specific 
5 for the polypeptides recited herein may be cloned, expressed and transferred into other 
vectors or effector cells for adoptive immunotherapy. The polypeptides provided herein 
may also be used to generate antibodies or anti-idiotypic antibodies (as described above 
and in U.S. Patent No. 4,918,164) for passive immunotherapy. 

Effector cells may generally be obtained in sufficient quantities for adoptive 

10 immunotherapy by growth in vitro, as described herein. Culture conditions for expanding 
single antigen-specific effector cells to several billion in number with retention of antigen 
recognition in vivo are well known in the art. Such in vitro culture conditions typically use 
intermittent stimulation with antigen, often in the presence of cytokines (such as IL-2) and 
non-dividing feeder cells. As noted above, immunoreactive polypeptides as provided 

15 herein may be used to rapidly expand antigen-specific T cell cultures in order to generate a 
sufficient number of cells for immunotherapy. In particular, antigen-presenting cells, such 
as dendritic, macrophage, monocyte, fibroblast and/or B cells, may be pulsed with 
immunoreactive polypeptides or transfected with one or more polynucleotides using 
standard techniques well known in the art. For example, antigen-presenting cells can be 

20 transfected with a polynucleotide having a promoter appropriate for increasing expression 
in a recombinant virus or other expression system. Cultured effector cells for use in 
therapy must be able to grow and distribute widely, and to survive long term in vivo. 
Studies have shown that cultured effector cells can be induced to grow in vivo and to 
survive long term in substantial numbers by repeated stimulation with antigen 

25 supplemented with IL-2 (see, for example, Cheever et al., Immunological Reviews J 57:177, 
1997). 

Alternatively, a vector expressing a polypeptide recited herein may be 
introduced into antigen presenting cells taken from a patient and clonally propagated ex 
vivo for transplant back into the same patient. Transfected cells may be reintroduced into 
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the patient using any means known in the art, preferably in sterile form by intravenous, 
intracavitary, intraperitoneal or intratumor administration. 

Routes and frequency of administration of the therapeutic compositions 
disclosed herein, as well as dosage, will vary from individual to individual, and may be 
5 readily established using standard techniques. In general, the pharmaceutical compositions 
and vaccines may be administered by injection (e.g., intracutaneous, intramuscular, 
intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. Preferably, 
between 1 and 10 doses may be administered over a 52 week period. Preferably, 6 doses 
are administered, at intervals of 1 month, and booster vaccinations may be given 

10 periodically thereafter. Alternate protocols may be appropriate for individual patients. A 
suitable dose is an amount of a compound that, when administered as described above, is 
capable of promoting an anti -tumor immune response, and is at least 10-50% above the 
basal (i.e., untreated) level. Such response can be monitored by measuring the anti-tumor 
antibodies in a patient or by vaccine-dependent generation of cytolytic effector cells 

15 capable of killing the patient's tumor cells in vitro. Such vaccines should also be capable 
of causing an immune response that leads to an improved clinical outcome (e.g., more 
frequent remissions, complete or partial or longer disease-free survival) in vaccinated 
patients as compared to non-vaccinated patients. In general, for pharmaceutical 
compositions and vaccines comprising one or more polypeptides, the amount of each 

20 polypeptide present in a dose ranges from about 25 (ig to 5 mg per kg of host. Suitable 
dose sizes will vary with the size of the patient, but will typically range from about 0.1 mL 
to about 5 mL. 

In general, an appropriate dosage and treatment regimen provides the active 
compound(s) in an amount sufficient to provide therapeutic and/or prophylactic benefit. 
25 Such a response can be monitored by establishing an improved clinical outcome (e.g., more 
frequent remissions, complete or partial, or longer disease-free survival) in treated patients 
as compared to non-treated patients. Increases in preexisting immune responses to a colon 
tumor protein generally correlate with an improved clinical outcome. Such immune 
responses may generally be evaluated using standard proliferation, cytotoxicity or cytokine 
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assays, which may be performed using samples obtained from a patient before and after 
treatment. 

Methods for Detecting Cancer 

5 In general, a cancer may be detected in a patient based on the presence of 

one or more colon tumor proteins and/or polynucleotides encoding such proteins in a 
biological sample (for example, blood, sera, sputum, urine and/or tumor biopsies) obtained 
from the patient. In other words, such proteins may be used as markers to indicate the 
presence or absence of a cancer such as colon cancer. In addition, such proteins may be 

10 useful for the detection of other cancers. The binding agents provided herein generally 
permit detection of the level of antigen that binds to the agent in the biological sample. 
Polynucleotide primers and probes may be used to detect the level of mRNA encoding a 
tumor protein, which is also indicative of the presence or absence of a cancer. In general, a 
colon tumor sequence should be present at a level that is at least three fold higher in tumor 

15 tissue than in normal tissue 

There are a variety of assay formats known to those of ordinary skill in the 
art for using a binding agent to detect polypeptide markers in a sample. See, e.g., Harlow 
and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In 
general, the presence or absence of a cancer in a patient may be determined by (a) 

20 contacting a biological sample obtained from a patient with a binding agent; (b) detecting 
in the sample a level of polypeptide that binds to the binding agent; and (c) comparing the 
level of polypeptide with a predetermined cut-off value. 

In a preferred embodiment, the assay involves the use of binding agent 
immobilized on a solid support to bind to and remove the polypeptide from the remainder 

25 of the sample. The bound polypeptide may then be detected using a detection reagent that 
contains a reporter group and specifically binds to the binding agent/polypeptide complex. 
Such detection reagents may comprise, for example, a binding agent that specifically binds 
to the polypeptide or an antibody or other agent that specifically binds to the binding agent, 
such as an anti-immunoglobulin, protein G, protein A or a lectin. Alternatively, a 
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competitive assay may be utilized, in which a polypeptide is labeled with a reporter group 
and allowed to bind to the immobilized binding agent after incubation of the binding agent 
; with the sample. The extent to which components of the sample inhibit the binding of the 
labeled polypeptide to the binding agent is indicative of the reactivity of the sample with 
5 the immobilized binding agent. Suitable polypeptides for use within such assays include 
full length colon tumor proteins and portions thereof to which the binding agent binds, as 
described above. 

The solid support may be any material known to those of ordinary skill in 
the art to which the tumor protein may be attached. For example, the solid support may be 
10 a test well in a microtiter plate or a nitrocellulose or . other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a plastic 
^ material such as polystyrene or polyvinylchloride. The support may also be a magnetic 

31 particle or a fiber optic sensor, such as those disclosed, for example, in U.S. Patent 

q No. 5,359,681. The binding agent may be immobilized on the solid support using a variety 

15 of techniques known to those of skill in the art, which are amply described in the patent and 
scientific literature. In the context of the present invention, the term "immobilization" 
□ refers to both noncovalent association, such as adsorption, and covalent attachment (which 

Jl may be a direct linkage between the agent and functional groups on the support or may be a 

~ linkage by way of a cross-linking agent). Immobilization by adsorption to a well in a 

3 20 microtiter plate or to a membrane is preferred. In such cases, adsorption may be achieved 

by contacting the binding agent, in a suitable buffer, with the solid support for a suitable 
amount of time. The contact time varies with temperature, but is typically between about 1 
hour and about 1 day. In general, contacting a well of a plastic microtiter plate (such as 
polystyrene or polyvinylchloride) with an amount of binding agent ranging from about 
25 10 ng to about 10 jag, and preferably about 100 ng to about 1 jig, is sufficient to immobilize 
an adequate amount of binding agent. 

Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a Afunctional reagent that will react with both 
the support and a functional group, such as a hydroxyl or amino group, on the binding 
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agent. For example, the binding agent may be covalently attached to supports having an 
appropriate polymer coating using benzoquinone or by condensation of an aldehyde group 
on the support with an amine and an active hydrogen on the binding partner {see, e.g., 
Pierce Immunotechnology Catalog and Handbook, 1991, at A12-A13). 

In certain embodiments, the assay is a two-antibody sandwich assay. This 
assay may be performed by first contacting an antibody that has been immobilized on a 
solid support, commonly the well of a microtiter plate, with the sample, such that 
polypeptides within the sample are allowed to bind to the immobilized antibody. Unbound 
sample is then removed from the immobilized polypeptide-antibody complexes and a 
detection reagent (preferably a second antibody capable of binding to a different site on the 
polypeptide) containing a reporter group is added. The amount of detection reagent that 
remains bound to the solid support is then determined using a method appropriate for the 
specific reporter group. 

More specifically, once the antibody is immobilized on the support as 
described above, the remaining protein binding sites on the support are typically blocked. 
Any suitable blocking agent known to those of ordinary skill in the art, such as bovine 
serum albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The immobilized 
antibody is then incubated with the sample, and polypeptide is allowed to bind to the 
antibody. The sample may be diluted with a suitable diluent, such as phosphate-buffered 
saline (PBS) prior to incubation. In general, an appropriate contact time {i.e., incubation 
time) is a period of time that is sufficient to detect the presence of polypeptide within a 
sample obtained from an individual with colon cancer. Preferably, the contact time is 
sufficient to achieve a level of binding that is at least about 95% of that achieved at 
equilibrium between bound and unbound polypeptide. Those of ordinary skill in the art 
will recognize that the time necessary to achieve equilibrium may be readily determined by 
assaying the level of binding that occurs over a period of time. At room temperature, an 
incubation time of about 30 minutes is generally sufficient. 

Unbound sample may then be removed by washing the solid support with an 
appropriate buffer, such as PBS containing 0.1% Tween 20™. The second antibody, which 
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contains a reporter group, may then be added to the solid support. Preferred reporter 
groups include those groups recited above. 

The detection reagent is then incubated with the immobilized antibody- 
polypeptide complex for an amount of time sufficient to detect the bound polypeptide. An 
5 appropriate amount of time may generally be determined by assaying the level of binding 
that occurs over a period of time. Unbound detection reagent is then removed and bound 
detection reagent is detected using the reporter group. The method employed for detecting 
the reporter group depends upon the nature of the reporter group. For radioactive groups, 
scintillation counting or autoradiographic methods are generally appropriate. 

10 Spectroscopic methods may be used to detect dyes, luminescent groups and fluorescent 
groups. Biotin may be detected using avidin, coupled to a different reporter group 
(commonly a radioactive or fluorescent group or an enzyme). Enzyme reporter groups may 
generally be detected by the addition of substrate (generally for a specific period of time), 
followed by spectroscopic or other analysis of the reaction products. 

15 To determine the presence or absence of a cancer, such as colon cancer, the 

signal detected from the reporter group that remains bound to the solid support is generally 
compared to a signal that corresponds to a predetermined cut-off value. In one preferred 
embodiment, the cut-off value for the detection of a cancer is the average mean signal 
obtained when the immobilized antibody is incubated with samples from patients without 

20 the cancer. In general, a sample generating a signal that is three standard deviations above 
the predetermined cut-off value is considered positive for the cancer. In an alternate 
preferred embodiment, the cut-off value is determined using a Receiver Operator Curve, 
according to the method of Sackett et al., Clinical Epidemiology: A Basic Science for 
Clinical Medicine, Little Brown and Co., 1985, p. 106-7. Briefly, in this embodiment, the 

25 cut-off value may be determined from a plot of pairs of true positive rates {i.e., sensitivity) 
and false positive rates (100%-specificity) that correspond to each possible cut-off value for 
the diagnostic test result. The cut-off value on the plot that is the closest to the upper left- 
hand corner {i.e., the value that encloses the largest area) is the most accurate cut-off value, 
and a sample generating a signal that is higher than the cut-off value determined by this 
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method may be considered positive. Alternatively, the cut-off value may be shifted to the 
left along the plot, to minimize the false positive rate, or to the right, to minimize the false 
negative rate. In general, a sample generating a signal that is higher than the cut-off value 
determined by this method is considered positive for a cancer. 

5 In a related embodiment, the assay is performed in a flow-through or strip 

test format, wherein the binding agent is immobilized on a membrane, such as 
nitrocellulose. In the flow-through test, polypeptides within the sample bind to the 
immobilized binding agent as the sample passes through the membrane. A second, labeled 
binding agent then binds to the binding agent-polypeptide complex as a solution containing 

10 the second binding agent flows through the membrane. The detection of bound second 
binding agent may then be performed as described above. In the strip test format, one end 
of the membrane to which binding agent is bound is immersed in a solution containing the 
sample. The sample migrates along the membrane through a region containing second 
binding agent and to the area of immobilized binding agent. Concentration of second 

15 binding agent at the area of immobilized antibody indicates the presence of a cancer. 
Typically, the concentration of second binding agent at that site generates a pattern, such as 
a line, that can be read visually. The absence of such a pattern indicates a negative result. 
In general, the amount of binding agent immobilized on the membrane is selected to 
generate a visually discernible pattern when the biological sample contains a level of 

20 polypeptide that would be sufficient to generate a positive signal in the two-antibody 
sandwich assay, in the format discussed above. Preferred binding agents for use in such 
assays are antibodies and antigen-binding fragments thereof. Preferably, the amount of 
antibody immobilized on the membrane ranges from about 25 ng to about l)ig, and more 
preferably from about 50 ng to about 500 ng. Such tests can typically be performed with a 

25 very small amount of biological sample. 

Of course, numerous other assay protocols exist that are suitable for use 
with the tumor proteins or binding agents of the present invention. The above descriptions 
are intended to be exemplary only. For example, it will be apparent to those of ordinary 
skill in the art that the above protocols may be readily modified to use colon tumor 
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polypeptides to detect antibodies that bind to such polypeptides in a biological sample. 
The detection of such colon tumor protein specific antibodies may correlate with the 
presence of a cancer. 

A cancer may also, or alternatively, be detected based on the presence of T 
5 cells that specifically react with a colon tumor protein in a biological sample. Within 
certain methods, a biological sample comprising CD4 + and/or CD8 + T cells isolated from a 
patient is incubated with a colon tumor polypeptide, a polynucleotide encoding such a 
polypeptide and/or an APC that expresses at least an immunogenic portion of such a 
polypeptide, and the presence or absence of specific activation of the T cells is detected. 

10 Suitable biological samples include, but are not limited to, isolated T cells. For example, T 
cells may be isolated from a patient by routine techniques (such as by Ficoll/Hypaque 
density gradient centrifugation of peripheral blood lymphocytes). T cells may be incubated 
in vitro for 2-9 days (typically 4 days) at 37°C with one or more representative 
polypeptides {e.g., 5-25 jag/ml). It may be desirable to incubate another aliquot of a T cell 

15 sample in the absence of colon tumor polypeptide to serve as a control. For CD4 + T cells, 
activation is preferably detected by evaluating proliferation of the T cells. For CD8 + T 
cells, activation is preferably detected by evaluating cytolytic activity. A level of 
proliferation that is at least two fold greater and/or a level of cytolytic activity that is at 
least 20% greater than in disease-free patients indicates the presence of a cancer in the 

20 patient. 

As noted above, a cancer may also, or alternatively, be detected based on the 
level of mRNA encoding a colon tumor protein in a biological sample. For example, at 
least two oligonucleotide primers may be employed in a polymerase chain reaction (PCR) 
based assay to amplify a portion of a colon tumor cDNA derived from a biological sample, 
25 wherein at least one of the oligonucleotide primers is specific for (i.e., hybridizes to) a 
polynucleotide encoding the colon tumor protein. The amplified cDNA is then separated 
and detected using techniques well known in the art, such as gel electrophoresis. Similarly, 
oligonucleotide probes that specifically hybridize to a polynucleotide encoding a colon 
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tumor protein may be used in a hybridization assay to detect the presence of polynucleotide 
encoding the tumor protein in a biological sample. 

To permit hybridization under assay conditions, oligonucleotide primers and 
probes should comprise an oligonucleotide sequence that has at least about 60%, preferably 
5 at least about 75% and more preferably at least about 90%, identity to a portion of a 
polynucleotide encoding a colon tumor protein that is at least 10 nucleotides, and 
preferably at least 20 nucleotides, in length. Preferably, oligonucleotide primers and/or 
probes will hybridize to a polynucleotide encoding a polypeptide disclosed herein under 
moderately stringent conditions, as defined above. Oligonucleotide primers and/or probes 

10 which may be usefully employed in the diagnostic methods described herein preferably are 
at least 10-40 nucleotides in length. In a preferred embodiment, the oligonucleotide 
primers comprise at least 10 contiguous nucleotides, more preferably at least 15 contiguous 
nucleotides, of a DNA molecule having a sequence recited in SEQ ID NO: 1-121, 123-197, 
205-630, 632-684, 686, 690-691, and 694-1081. Techniques for both PGR based assays 

15 and hybridization assays are well known in the art {see, for example, Mullis et al., Cold 
Spring Harbor Symp. Quant Biol., 57:263, 1987; Erlich ed., PCR Technology, Stockton 
Press, NY, 1989). 

One preferred assay employs RT-PCR, in which PCR is applied in 
conjunction with reverse transcription. Typically, RNA is extracted from a biological 

20 sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. PCR 
amplification using at least one specific primer generates a cDNA molecule, which may be 
separated and visualized using, for example, gel electrophoresis. Amplification may be 
performed on biological samples taken from a test patient and from an individual who is 
not afflicted with a cancer. The amplification reaction may be performed on several 

25 dilutions of cDNA spanning two orders of magnitude. A two-fold or greater increase in 
expression in several dilutions of the test patient sample as compared to the same dilutions 
of the non-cancerous sample is typically considered positive. 

In another embodiment, the disclosed compositions may be used as markers 
for the progression of cancer. In this embodiment, assays as described above for the 
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diagnosis of a cancer may be performed over time, and the change in the level of reactive 
polypeptide(s) or polynucleotide evaluated. For example, the assays may be performed 
every 24-72 hours for a period of 6 months to 1 year, and thereafter performed as needed. 
In general, a cancer is progressing in those patients in whom the level of polypeptide or 
polynucleotide detected increases over time. In contrast, the cancer is not progressing 
when the level of reactive polypeptide or polynucleotide either remains constant or 
decreases with time. 

Certain in vivo diagnostic assays may be performed directly on a tumor. 
One such assay involves contacting tumor cells with a binding agent. The bound binding 
agent may then be detected directly or indirectly via a reporter group. Such binding agents 
may also be used in histological applications. Alternatively, polynucleotide probes may be 
used within such applications. 

As noted above, to improve sensitivity, multiple colon tumor protein 
markers may be assayed within a given sample. It will be apparent that binding agents 
specific for different proteins provided herein may be combined within a single assay. 
Further, multiple primers or probes may be used concurrently. The selection of tumor 
protein markers may be based on routine experiments to determine combinations that 
results in optimal sensitivity. In addition, or alternatively, assays for tumor proteins 
provided herein may be combined with assays for other known tumor antigens. 

Diagnostic Kits 

The present invention further provides kits for use within any of the above 
diagnostic methods. Such kits typically comprise two or more components necessary for 
performing a diagnostic assay. Components may be compounds, reagents, containers 
and/or equipment. For example, one container within a kit may contain a monoclonal 
antibody or fragment thereof that specifically binds to a colon tumor protein. Such 
antibodies or fragments may be provided attached to a support material, as described 
above. One or more additional containers may enclose elements, such as reagents or 
buffers, to be used in the assay. Such kits may also, or alternatively, contain a detection 
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reagent as described above that contains a reporter group suitable for direct or indirect 
detection of antibody binding. 

Alternatively, a kit may be designed to detect the level of mRNA encoding a 
colon tumor protein in a biological sample. Such kits generally comprise at least one 
oligonucleotide probe or primer, as described above, that hybridizes to a polynucleotide 
encoding a colon tumor protein. Such an oligonucleotide may be used, for example, within 
a PCR or hybridization assay. Additional components that may be present within such kits 
include a second oligonucleotide and/or a diagnostic reagent or container to facilitate the 
detection of a polynucleotide encoding a colon tumor protein. 

The following Examples are offered by way of illustration and not by way 

of limitation. 
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EXAMPLES 
Example 1 

ISOLATION AND CHARACTERIZATION OF COLON TUMOR POLYPEPTIDES BY 
PCR-BASED SUBTRACTION AND MICROARRAY ANALYSIS 

A cDNA library was constructed in the PCR2.1 vector (Invitrogen, 
Carlsbad, CA) by subtracting a pool of three colon tumors with a pool of normal colon, 
spleen, brain, liver, kidney, lung, stomach and small intestine using PCR subtraction 
methodologies (Clontech, Palo Alto, CA). The subtraction was performed using a PCR- 
based protocol, which was modified to generate larger fragments. Within this protocol, 
tester and driver double stranded cDNA were separately digested with five restriction 
enzymes that recognize six-nucleotide restriction sites (Mlul, MscI, PvuII, Sail and StuI). 
This digestion resulted in an average cDNA size of 600 bp, rather than the average size of 
300 bp that results from digestion with Rsal according to the Clontech protocol. This 
modification did not affect the subtraction efficiency. Two tester populations were then 
created with different adapters, and the driver library remained without adapters. 

The tester and driver libraries were then hybridized using excess driver 
cDNA. In the first hybridization step, driver was separately hybridized with each of the 
two tester cDNA populations. This resulted in populations of (a) unhybridized tester 
cDNAs, (b) tester cDNAs hybridized to other tester cDNAs, (c) tester cDNAs hybridized to 
driver cDNAs, and (d) unhybridized driver cDNAs. The two separate hybridization 
reactions were then combined, and rehybridized in the presence of additional denatured 
driver cDNA. Following this second hybridization, in addition to populations (a) through 
(d), a fifth population (e) was generated in which tester cDNA with one adapter hybridized 
to tester cDNA with the second adapter. Accordingly, the second hybridization step 
resulted in enrichment of differentially expressed sequences which could be used as 
templates for PCR amplification with adaptor-specific primers. 
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The ends were then filled in, and PCR amplification was performed using 
adaptor-specific primers. Only population (e), which contained tester cDNA that did not 
hybridize to driver cDNA, was amplified exponentially. A second PCR amplification step 
was then performed, to reduce background and further enrich differentially expressed 
5 sequences. 

This PCR-based subtraction technique normalizes differentially expressed 
cDNAs so that rare transcripts that are over-expressed in colon tumor tissue may be 
recoverable. Such transcripts would be difficult to recover by traditional subtraction 
methods. 

10 To characterize the complexity and redundancy of the subtracted library, 96 

clones were randomly picked and 65 were sequenced, as previously described. These 
sequences were further characterized by comparison with the most recent Genbank 
database (April, 1998) to determine their degree of novelty. No significant homologies 
were found to 21 of these clones, hereinafter referred to as 11092, 11093, 11096, 11098, 

15 11103, 11174, 11108, 11112, 11115, 11117, 11118, 11134, 11151, 11154, 11158, 11168, 
1 1 172, 1 1 175, 1 1 184, 1 1 185 and 1 1 187. The determined cDNA sequences for these clones 
are provided in SEQ ID NO: 48, 49, 52, 54, 59, 60, 65-69, 79, 89, 90, 93, 99-101 and 109- 
111, respectively. 

Two-thousand clones from the above mentioned cDNA subtraction library 
20 were randomly picked and submitted to a round of PCR amplification. Briefly, 0.5 (il of 
glycerol stock solution was added to 99.5 |il of per MIX (80 |il H 2 0, 10 \x\ 10X PCR 
Buffer, 6 |al 25 mM MgCl 2 , 1 jil 10 mM dNTPs, 1 jal 100 mM Ml 3 forward primer 
(CACGACGTTGTAAAACGACGG), 1 \il 100 mM Ml 3 reverse primer 
(CACAGGAAACAGCTATGACC)), and 0.5 \x\ 5 u/ml Taq polymerase (primers provided 
25 by (Operon Technologies, Alameda, CA). The PCR amplification was run for thirty cycles 
under the following conditions: 95°C for 5 min., 92°C for 30 sec, 57°C for 40 sec, 75°C 
for 2 min. and 75°C for 5 minutes. 

mRNA expression levels for representative clones were determined using 
microarray technology (Synteni, Palo Alto, CA) in colon tumor tissues (n=25), normal 
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colon tissues (n=6), kidney, lung, liver, brain, heart, esophagus, small intestine, stomach, 
pancreas, adrenal gland, salivary gland, resting PBMC, activated PBMC, bone marrow, 
dendritic cells, spinal cord, blood vessels, skeletal muscle, skin, breast and fetal tissues. 
The number of tissue samples tested in each case was one (n=l), except where specifically 

5 noted above; additionally, all the above-mentioned tissues were derived from humans. The 
PCR amplification products were dotted onto slides in an array format, with each product 
occupying a unique location in the array. mRNA was extracted from the tissue sample to 
be tested, and fluorescent-labeled cDNA probes were generated by reverse transcription 
according to the protocol provided by Synteni. The microarrays were probed with the 

10 labeled cDNA probes, the slides scanned, and fluorescence intensity was measured. This 
intensity correlates with the hybridization intensity. 

One hundred and forty nine clones showed two or more fold over- 
expression in the colon tumor probe group as compared to the normal tissue probe group. 
These cDNA clones were further characterized by DNA sequencing with a Perkin 

15 Elmer/ Applied Biosystems Division Automated Sequencer Model 373A and/or Model 377 
(Foster City, CA). These sequences were compared to known sequences in the most recent 
GenBank database. No significant homologies to human gene sequences were found in 
forty nine of these clones, represented by the following sixteen cDNA consensus 
sequences: SEQ ID NO: 2, 8, 15, 16, 22, 24, 30, 32-34, 36, 38, 40, 41, 46 and 47, 

20 hereinafter referred to as Contig 2, 8, 13, 14, 20, 23, 29, 31, 35, 32, 36, 38, 41, 42, 50 and 
51, respectively). Contig 29 (SEQ ID NO: 30) was found to be a Rat GSK-3-p-interacting 
protein Axil homolog. Also, Contigs 31 and 35 (SEQ ID NO: 32 and 33, respectively) 
were found to be a Mus musculus GOB-4 homolog. The determined cDNA sequences of 
SEQ ID NO: 1, 3-7, 9-14, 17-21, 23, 25-29, 31, 35, 37, 39, 42-45, 50, 51, 53, 55-58, 61-64, 

25 70-78, 80-88, 91, 92, 94-98, 102-108 and 112 were found to show some homology to 
previously identified genes sequences. 

Microarray analysis demonstrated Contig 2 (SEQ ID NO: 2) showed over- 
expression in 34% of colon tumors tested, as well as increased expression in normal 
pancreatic tissue, with no over-expression in normal colon tissues. Upon further analysis, 
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Contigs 2, 8 and 23 were found to share homology to the known gene GW112. Contigs 4, 
5, 9 and 52 showed homology to carcinoembryonic antigen (SEQ ED NO: 3, 4, 5 and 6, 
respectively). A representative sampling of these fragments showed over-expression in 
85% of colon tumors, with over-expression in normal bone marrow and 3/6 normal colon 

5 tissues. Contig 6 (SEQ ID NO: 7), showing homology to the known gene sequence for 
villin, and was over-expressed in about half of all colon tumors tested, with a limited 
degree of low level over-expression in normal colon. Contig 12 (SEQ ID NO: 14), 
showing homology to Chromosome 17, clone hRPC.1171_I_10, also referred to as C798P, 
was over-expressed in approximately 70% of colon tumors tested, with low over- 

10 expression in 1/6 normal colon samples. Contig 14, also referred to as 14261 (SEQ ID NO: 
16), showing no significant homology to any known gene, showed over-expression in 44% 
of colon tumors tested, with low level expression in half of normal colon tissues, as well as 
small intestine and pancreatic tissue. Contig 18 (SEQ ID NO: 21), showing homology to 
the known gene for Ll-cadherin, showed over-expression in approximately half of colon 

15 tumors and low level over-expression in 3/6 normal colon tissues tested. Contig 22 (SEQ 
ID NO: 23), showing homology to Bumetanide-sensitive Na-K-Cl cotransporter was over- 
expressed in 70% of colon tumors and no over-expression in all normal tissues tested. 
Contig 25 (SEQ ID NO: 25), showing homology to macrophage inflammatory protein-3a, 
was over-expressed in over 40% of colon tumors and in activated PBMC. Contigs 26 and 

20 48 (SEQ ID NOS: 25 and 26), showing homology to the sequence for laminin, was over- 
expressed in 48% of colon tumors and with low over-expression in stomach tissue. Contig 
28 (SEQ ID NO: 29), showing homology to the known gene sequence for Chromosome 16 
BAC clone CIT987SK-A-363E6, was over-expressed in 33% of colon tumors tested with 
normal stomach and 2/6 normal colon tissues showing low level over-expression. Contigs 

25 29, 31 and 35 (SEQ ID NOS: 30, 32 and 33, respectively), also referred to as C751P, an 
unknown sequence showing limited and partial homology to Rat GSK-3p-interacting 
protein Axil homolog.and Mus musculus GOB-4 homolog, was over-expressed in 74% of 
colon tumors and no over-expression in all normal tissues tested. Contig 34 (SEQ ID NO: 
35), showing homology to the known sequence for desmoglein 2, was over-expressed in 

96 



210121.471C11 



56% of colon tumors and showed low level over-expression in 1/6 normal colon tissues. 
Contig 36 (SEQ ID NO: 36), an unknown sequence also referred to as C793P, showed 
over-expression in 30% of colon tumor tissues tested. Contig 37 and 14287.2 (SEQ ID 
NOS: 37 and 116), an unknown sequence, but with limited (89%) homology to the known 
5 sequence for putative transmembrane protein was over-expressed in 70% of colon tumors, 
as well as in normal lung tissue and 3/6 normal colon tissues tested. Contig 38, also 
referred to as C796P and 14219 (SEQ ID NO: 38), showing no significant homology to any 
known gene, was over-expressed in 38% in colon tumors and no elevated over-expression 
in any normal tissues. Contig 41 (SEQ ID NO: 40), also referred to as C799P and 14308, 
10 an unknown sequence showing no significant homology to any known gene, was over- 
expressed in 22% of colon tumors. Contig 42, (SEQ ID NO: 41), also referred to as C794P 
y and 14309, an unknown sequence with no significant homology to any known gene, was 

01 over-expressed in 63% of colon tumors tested, as well as in 3/6 normal colon tissues. 

•Jj Contig 43 (SEQ ID NO: 42), showing homology to the known sequence for Chromosome 1 

h" 15 specific transcript KIAA0487 was over-expressed in 85% of colon tumors tested and in 

normal lung and 4/6 normal colon tissues. Contig 49 (SEQ ID NO: 45), showing 
@ homology to the known sequence for pump-1, was over-expressed in 44% of colon tumors 

and no over-expression in all normal tissues tested. Contig 50 (SEQ ID NO: 46), also 
^ referred to as C792P and 18323, showing no significant homology to any known gene, was 

■3— J 

O 20 over-expressed in 33% of colon tumors with no detectable over-expression in any normal 

tissues tested. Contig 51 (SEQ ID NO: 47), also referred to as C795P and 14317 was over- 
expressed in 1 1% of colon tumors. 

Additional microarray analysis yielded seven clones showing two or more 
fold over-expression in the colon tumor probe group as compared to the normal tissue 
25 probe group. Three of these clones demonstrated particularly good colon tumor specificity, 
and are represented by SEQ ID NO: 115, 116 and 120. Specifically, SEQ ID NO: 115, 
referred to as C791P or 14235, which shows homology to the known gene sequence for H. 
sapiens chromosome 21 derived BAC containing ets-2 gene, was over-expressed in 89% of 
colon tumors tested and in 5/6 normal colon tissues, as well as over-expressed at low levels 
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in normal lung and activated PBMC Microarray analysis for SEQ ID NO: 116 is 
discussed above. SEQ ID NO: 120, referred to as 14295, showing homology to the known 
gene sequence for secreted cement gland protein XAG-2 homolog, was over-expressed in 
70% of colon tumors and in 5/6 normal colon tissues, as well as low level over-expression 

5 in normal small intestine, stomach and lung. All clones showing over-expression in colon 
tumor were sequenced and these sequences compared to the most recent Genbank database 
(February 12, 1999). Of the seven clones, three contained sequences that did not share 
significant homology to any known gene sequences, represented by SEQ ID NO: 116, 117 
and 119. To the best of the inventors' knowledge, none of these sequences have been 

10 previously shown to be present in colon. The determined cDNA sequences of the 
remaining clones (SEQ ID NO: 113-115 and 120) were found to show some homology to 
previously identified genes. 

Further analysis identified a clone which was recovered several times by 
PCR subtraction and by expression screening using a mouse anti-scid antiserum. The 

15 determined full length cDNA sequence for this clone is provided in SEQ ID NO: 121, with 
the corresponding predicted amino acid sequence being provided in SEQ ID NO: 122. 
This clone is homologous with the known gene Beta IG-H3, as disclosed in U.S. Patent No. 
5,444,164. Microarray analysis demonstrated this clone to be over-expressed in 75 to 80% 
of colon tumors tested (n=27), with no over-expression in normal colon samples (n=6), but 

20 with some low level over-expression in other normal tissues tested. 

Further analysis of the PCR-subtraction library described above led to the 
isolation of longer cDNA sequences for the clones of SEQ ID NO: 30, 115, 46, 1 18, 41, 47, 
38, 113, 14 and 40 (known as C751P, C791P, C792P, C793P, C794P, C795P, C796P, 
C797P, C798P and C799P, respectively). These determined cDNA sequences are provided 

25 in SEQ ID NO: 123-132, respectively. Additional sequences for the clones C794P and 
C799P are shown in SEQ ID NO:683 and 684, respectively, and the predicted amino acid 
sequences are shown in SEQ ID NO:685 and 686, respectively. Still further sequences for 
the clones C794P and C799P are shown in SEQ ID NO: 691 and 690, respectively, and to 
the predicted amino acid sequence as shown in SEQ ID NO: 693 and 692, respectively. 
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Using PCR subtraction methodology described above with minor 
modifications, transcripts from a pool of three moderately differentiated colon 
adenocarcinoma samples were subtracted with a set of transcripts from normal brain, 
pancreas, bone marrow, liver, heart, lung, stomach and small intestine. Modifications of 
5 the above protocol were included at the cDNA digestion steps and in the tester to drive 
hybridization ratios. In a first subtraction, the restriction enzymes PvuII, Dral, MscI and 
StuI were used to digest cDNAs, and the tester to driver ratio was 1 :40, as suggested by 
Clontech. In a second subtraction, Dral, MscI and StuI were used for cDNA digestion and 
a tester to driver ratio of 1:76 was used. Following the PCR amplification steps, the 

10 cDNAs were clones into pCR2.1 plasmid vector. The determined cDNA sequences of 167 
isolated clones are provided in SEQ ID NO: 205-371. These sequences were compared to 
sequences in the public databases as described above. The sequences of SEQ ID NO: 205, 
207, 210-212, 214, 215, 218, 224-226, 228, 233, 234, 236, 238, 241, 242, 245, 246, 248, 
250, 253, 254, 256, 259, 260, 262, 263, 266, 267, 270-273, 279, 282, 291, 293, 294, 298, 

15 300, 302, 303, 310-313, 315, 317, 320, 322, 324, 332-335, 345, 347, 356, 358, 361, 362, 
366, 369 and 371 were found to show some homology to previously identified ESTs. The 
remaining sequences were found to show some homology to previously identified genes. 

Using the PCR subtraction technology described above, a cDNA library 
from a pool of primary colon tumors was subtracted with a cDNA library prepared from 

20 normal tissues, including brain, bone marrow, kidney, heart, lung, liver, pancreas, small 
intestine, stomach and trachea. The determined cDNA sequences for 90 clones isolated in 
this subtraction are provided in SEQ ID NO: 372-461. Comparison of these sequences 
with those in the public databases as described above, revealed no homologies to the 
sequences of SEQ ID NO: 426, 445 and 453. The sequences of SEQ ID NO: 372-378, 380- 

25 404, 406, 409-417, 419-423, 425, 427-429, 433-436, 438-441, 443, 446-451, 454, 455 and 
457-461 showed some homology to previously identified genes, while the sequences of 
SEQ ID NO: 379, 405, 407, 408, 418, 424, 430-432, 437, 442, 444, 452 and 456 showed 
some homology to previously isolated ESTs. 
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Using the PCR subtraction methodology described above, a cDNA library 
prepared from a pool of metastatic colon tumors was subtracted with cDNA from a pool of 
normal tissues, namely brain, heart, lung, lymph nodes, PBMC, pancreas, small intestine 
and stomach. The determined cDNA sequences for 82 clones isolated from the subtracted 
5 library are provided in SEQ ID NO: 487-568 (referred to as contigs 1-56 and 58-83, 
respectively). The sequences of SEQ ID NO: 487, 489, 490, 493-496, 499, 501-509, 511- 
518, 520-526, 529-542, 544, 546, 548-552, 554, 555, 557, 558, 560, 562, 563, 566 and 567 
showed some homology to previously identified gene sequences. The sequences of SEQ 
ID NO: 488, 491, 492, 497, 498, 500, 510, 519, 527, 528, 543, 545, 547, 553, 559, 564, 
10 564 and 568 showed some homology to previously isolated ESTs. 

Example 2 

ISOLATION OF TUMOR POLYPEPTIDES 
USING SCID MOUSE-PASSAGED TUMOR RNA 

15 

Human colon tumor antigens were obtained using SCID mouse passaged 
colon tumor RNA as follows. Human colon tumor was implanted in SCID mice and 
harvested, as described in Patent Application Serial No. 08/556,659 filed 11/13/95, U.S. 
Patent No. 5,986,170. First strand cDNA was synthesized from poly A+ RNA from three 

20 SCID mouse-passaged colon tumors using a Lambda ZAP Express cDNA synthesis kit 
(Stratagene). The reactions were pooled and digested with RNase A, Tl and H to cleave 
the RNA and then treated with NaOH to degrade the RNA. The resulting cDNA was 
annealed with biotinylated (Vector Labs, Inc., Burlingame, CA) cDNA from a normal 
resting PBMC plasmid library (constructed from Superscript plasmid System, Gibco BRL), 

25 and subtracted with streptavidin by phenol/chloroform extraction. Second strand cDNA 
was synthesized from the subtracted first strand cDNA and digested with SI nuclease 
(Gibco BRL). The cDNA was blunted with Pfu polymerase and EcoRI adaptors 
(Stratagene) were ligated to the ends. The cDNA was phosphorylated with T4 
polynucleotide kinase, digested with restriction endonuclease Xhol, and size selected with 
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Sephacryl S-400 (Sigma). Fractions were pooled, ligated to Lambda ZAP Express arms 
(Stratagene) and packaged with Gigapack Gold III extract (Stratagene). Random plaques 
were picked, phagemid was excised, transformed into XLOLR cells (Stratagene) and 
resulting plasmid DNA (Qiagen Inc., Valencia, CA) was sequenced as described above. 
5 The determined cDNA sequences for 1 7 clones isolated as described above 

are provided in SEQ ID NO: 133-151, wherein 133 and 134 represent partial sequences of a 
clone referred to as CoSub-3 and SEQ ID NO: 135 and 136 represent partial sequences of a 
clone referred to as CoSub-13. These sequences were compared with those in the public 
databases as described above. The sequences of SEQ ID NO: 139 and 149 showed no 

10 significant homologies to any previously identified sequences. The sequences of SEQ ID 
NO: 138, 140, 141, 142, 143, 148 and 149 showed some homology to previously isolated 
expressed sequence tags (ESTs). The sequences of SEQ ID NO: 133-137, 144-147, 150 
and 151 showed some homology to previously isolated gene sequences. 

The determined cDNA sequences for an additional 46 clones isolated as 

15 described above, are provided in SEQ ID NO: 569-616, wherein SEQ ID NO: 573 and 574 
represent the 3' and 5' determined cDNA sequences, respectively, for clone CS1-106, and 
SEQ ID NO: 579 and 580 represent the determined 3' and 5' cDNA sequences, 
respectively, for clone CS 1-124. Comparison of the isolated sequences with those in the 
public databases revealed no significant homologies to the sequences of SEQ ID NO: 580, 

20 585, 610 and 613. The sequences of SEQ ID NO: 569, 574-577, 584, 587, 592, 595, 598, 
603 and 608 showed some homology to previously isolated ESTs, while the sequences of 
SEQ ID NO: 570-573, 578, 581-583, 586, 588-591, 593, 594, 596, 597, 599-602, 604-607, 
609, 611, 612 and 514-616 showed some homology to previously isolated gene sequences. 

25 Example 3 

USE OF MOUSE ANTISERA TO IDENTIFY DNA SEQUENCES 
ENCODING COLON TUMOR ANTIGENS 
This example illustrates the isolation of cDNA sequences encoding colon 
tumor antigens by screening of colon tumor cDNA libraries with mouse anti-tumor sera. 
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A cDNA expression library was prepared from SCID mouse-passaged 
human colon tumor poly A+ RNA using a Stratagene (La Jolla, CA) Lambda ZAP Express 
kit, following the manufacturer's instructions. Sera was obtained from the colon tumor- 
bearing SCED mouse. This serum was injected into normal mice to produce anti-colon 
5 tumor serum. Approximately 600,000 PFUs were screened from the unamplified library 
using this antiserum. Using a goat anti-mouse IgG-A-M (H+L) alkaline phosphatase 
second antibody developed with NBT/BCIP (BRL Labs.), positive plaques were identified. 
Phage was purified and phagemid excised for several clones with inserts in a pBK-CMV 
vector for expression in prokaryotic or eukaryotic cells. 

10 The determined cDNA sequences for 46 of the isolated clones are provided 

in SEQ ID NO: 152-197. The predicted amino acid sequences for the cDNA sequences of 
SEQ ID NO: 187, 188, 189, 190, 194, 195 and 197 are provided in SEQ ID NO: 198-204, 
respectively. The determined cDNA sequences were compared with those in the public 
database as described above. The sequences of SEQ ED NO: 156, 168, 184, 189, 192 and 

15 196 showed some homology to previously isolated ESTs. The sequences of SEQ ED NO: 
152-155, 157-167, 169-182, 183, 185-188, 190, 194, 195 and 197 showed some homology 
to previously identified genes. 

The determined cDNA sequences for an additional eleven clones isolated as 
described above, are provided in SEQ ED NO: 617-627. Comparison of these sequences 

20 with those in the public database as described above revealed no known homologies to 
SEQ ID NO: 621 and 623. The sequences of SEQ ID NO: 622 and 626 were found to 
show some homology to previously isolated ESTs, while the sequences of SEQ ID NO: 
617-620, 624, 625 and 627 showed some homology to previously identified genes. 

In further studies, a cDNA library was prepared from SCID-mouse grown 

25 colon tumors and screened with mouse anti-SCID serum as described above. Briefly first 
strand cDNA was synthesized from poly A+ RNA from three SCID mouse-grown human 
colon tumors using a Lambda ZAP Express cDNA synthesis kit (Stratagene). The 
reactions were pooled and digested with RNase A, Tl and H to cleave the RNA and then 
treated with NaOH to degrade the RNA. The cDNA was annealed with biotinylated cDNA 
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from a normal resting PBMC plasmid library (constructed from Superscript plasmid 
system; Gibco BRL) and subtracted with streptavidin by phenol/chloroform extraction. 
Second strand cDNA was synthesized from the subtracted first strand cDNA and digested 
with SI nuclease. The cDNA was blunted with Pfu polymerase and EcoRI adaptors were 
ligated to the ends. The cDNA was phosphorylated with T4 polynucleotide kinase, 
digested with restriction endonuclease Xhol, and size selected with Sephacryl S-400 
(Sigma). Fractions were pooled, ligated to Lambda ZAP Express arms (Stratagene) and 
packaged with Gigapack Gold III extract (Stratagene). The resulting library was screened 
with a mouse antiserum raised against serum from SCID mice containing human colon 
tumors, including the three tumors used to prepare the cDNA libraries. 

The determined cDNA for one clone isolated using this procedure is 
provided in SEQ ID NO: 630. This clone was found to show homology to a previously 
identified gene. The amino acid sequence encoded by the clone of SEQ ID NO: 630 is 
provided in SEQ ID NO: 631. 

In subsequent studies, an additional cDNA library was prepared from a 
SCID-passaged human colon tumor and screened with a mouse antiserum raised against 
serum from the SCID mouse containing the colon tumor. The determined cDNA sequences 
for 51 clones isolated in these studies are provided in SEQ ED NO: 632-682. Comparison 
of these sequences with those in the public databases revealed no significant homologies to 
the sequences of SEQ ID NO: 648 and 668. The sequence of SEQ ID NO: 642 showed 
some homology to previously isolated ESTs. The sequences of SEQ ID NO: 632-641, 643- 
647, 649-667 and 669-682 were found to show some homology to previously identified 
genes. SEQ ID NO: 684 and SEQ ID NO: 690 showed homology to human 
NADH/NADPH thyroid oxidase pl38-tox mRNA. 

Example 4 

ISOLATION AND CHARACTERIZATION OF COLON TUMOR 
POLYPEPTIDES BY CONVENTIONAL SUBTRACTION 
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Two cDNA libraries were constructed and used to create a subtracted cDNA 
library as follows. 

Using the GibcoBRL Superscript Plasmid System with minor modifications, 
two cDNA libraries were created. The first library, referred to as CTCL, was prepared 
5 from a pool of mRNA samples from three colon adenocarcinoma tissue samples. Two of 
the samples were described as Duke's stage C and one as Duke's stage B. All three 
samples were grade III in histological status. A second library (referred to as 
DriverLibpcDNA3.1+) was prepared from a pool of normal tissues, namely liver, pancreas, 
skin, bone marrow, resting PBMC, stomach and brain. Both libraries were prepared using 
10 the manufacturer's instructions with the following modifications: an EcoRI-NotI 5' cDNA 
adapter was used instead of the provided reagent; the vector pCDNA3.1(+) (Invitrogen) 
y was substituted for the pSPORT vector; and the ligated DNA molecules were transformed 

01 into ElectroMaxDHlOB electrocompetent cells. Clones from the libraries were analyzed 

J] by restriction digest and sequencing to determine average insert size, quality of the library 

15 and complexity of the library. DNA was prepared from each library and digested. 
H The driver DNA was biotinylated and hybridized with the colon library 

p tester DNA at a ratio of 10:1. After two rounds of hybridizations, streptavidin incubations 

~ and extractions, the remaining colon cDNAs were size-selected by column chromatography * 

J and cloned into the pCMV-Script vector from Stratagene. Clones from this subtracted 

□ 20 library (referred to as CTCL-S1) were characterized as described above for the < 

unsubtracted libraries. 

The determined cDNA sequences for 20 clones isolated from the CTCL-S1 
library are provided in SEQ ID NO: 462-479, 628 and 629. Comparison of these sequences 
with those in the public databases, as described above, revealed no significant homologies 
25 to the sequences of SEQ ID NO: 476, 477 and 479. The remaining sequences showed 
some homology to previously identified genes. 

In further studies, a cDNA library was prepared from a pool of mRNA from j 
three metastatic colon adenocarcinomas derived from liver tissue samples. All samples 
were described as Duke's stage D. Conventional subtraction was performed as described 
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above, using the DriverLibpcDNA3.1-i- library described above as the driver. The resulting 
subtracted library (referred to as CMCL-S1) was characterized by isolating a set of clones 
for restriction analysis and sequencing. 

The determined cDNA sequences for 7 clones isolated from the CMCL-S1 
library are provided in SEQ ID NO: 480-486. Comparison of these sequences with those in 
the public databases revealed no significant homologies to the sequence of SEQ ID NO: 
483. The sequences of SEQ ID NO: 480-482 and 484-486 were found to show some 
homology to previously identified genes. 

Example 5 
SYNTHESIS OF POLYPEPTIDES 

Polypeptides may be synthesized on a Perkin Elmer/ Applied Biosystems 
Division 430A peptide synthesizer using FMOC chemistry with HPTU (O-Benzotriazole- 
N,N,N ! ,N ! -tetramethyluronium hexafluorophosphate) activation. A Gly-Cys-Gly sequence 
may be attached to the amino terminus of the peptide to provide a method of conjugation, 
binding to an immobilized surface, or labeling of the peptide. Cleavage of the peptides 
from the solid support may be carried out using the following cleavage mixture: 
trifluoroacetic acid:ethanedithiol:thioanisole:water:phenol (40:1:2:2:3). After cleaving for 
2 hours, the peptides may be precipitated in cold methyl-t-butyl-ether. The peptide pellets 
may then be dissolved in water containing 0.1% trifluoroacetic acid (TFA) and lyophilized 
prior to purification by CI 8 reverse phase HPLC. A gradient of 0%-60% acetonitrile 
(containing 0.1% TFA) in water (containing 0.1% TFA) may be used to elute the peptides. 
Following lyophilization of the pure fractions, the peptides may be characterized using 
electrospray or other types of mass spectrometry and by amino acid analysis. 

From the foregoing it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
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various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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